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HUMAN PRSS11-LIKE S2 SERINE PROTEASE AND USES THEREOF 

FIELD OF THE INVENTION 

This application is a continuation-in-part of U.S. 
5 Patent Application No. 10/189,099 filed July 3, 2002 and 
entitled "Human PRSS11-L Like S2 Serine Protease and Uses 
Thereof" the contents of which is hereby incorporated by 
reference . 

The present invention relates to an S2 serine 
10 protease. In particular, the present invention relates 
to isolated nucleic acid molecules and polypeptides of a 
novel human S2 serine protease PRSS11-L and uses thereof. 

BACKGROUND OF THE INVENTION 

15 Two distinct members of the S2 serine protease 

subfamily have been previously identified in humans. One is 
protein L56/PRSS11 (Zumbrunn et al . , (1996), FEBS Lett. 
398:187-192; Genbank Protein ID: CAA69226) . The other is 
protein 0mi/HtrA2 (Faccio et al . , (2000), J. Biol. Chem. 

20 275:2581-2588, Genbank Protein ID: AAB94569) . These 

proteases share homologues with a bacterial HtrA (high- 
temperature requirement A) endoprotease, which acts as a 
chaperone at low temperatures and as a proteolytic enzyme 
that removes denatured or damaged substrates at elevated 

25 temperatures. The two members of human S2 serine proteases 
share extensive homology at their carboxy termini. 

Human S2 serine proteases are believed to play 
important roles in cellular physiology. PRSS11 is 
upregulated in osteoarthritic cartilage and secreted. It 

30 was suggested that PRSS11 regulates the availability of IGFs 
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by cleaving IGF-binding proteins. HtrA2 is upregulated 
during stress and appears to localize in the endoplasmic 
reticulum. It was shown that HtrA2 regulates apoptosis by 
interacting with the X chromosome -linked inhibitor of 
5 apoptosis (XIAP) (Suzuki et al . , (2001), Mol cell 8:613-21). 

SUMMARY OF THE INVENTION 

The present invention relates to isolated nucleic 
acid molecules encoding a novel human S2 serine protease, 
10 herein referred to as PRSS11-L (PRSSll-Like) , the 
polypeptides encoded by the isolated nucleic acid 
sequences, and the use of the nucleic acid molecules and 
polypeptides thereof . 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts an alignment of the amino acid 
sequences of the PRSS11-L protein and the two other S2 
serine proteases, PRSS11 (Genbank Protein ID: CAA69226) and 
Omi (Genbank Protein ID: AAB9456 9) . The alignment was 

20 performed using the Wisconsin GCG Gap Needleman and Wunsch 
algorithm. The active site residues of the catalytic triad 
are indicated above the sequences by asterisks (*) . 

Figure 2 depicts an alignment of the amino acid 
sequences of the PRSS11-L protein and HtrA3 (Genbank Protein 

25 ID: AAH34390) . The alignment was performed using the 
Wisconsin GCG Gap Needleman and Wunsch algorithm. 

Figure 3 is a schematic representation of the genomic 
structures of the PRSS11-L and HtrA3 long-form. The filled 
black boxes represent coding sequence exons while the filled 

30 bars represent the non-coding exons. The lines between 
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exons represent introns. The numbers correspond to the 
nucleotide positions of the reverse complementary sequence 
to the BAG clone from human chromosome 4 (GenBank Accession 
No. AC113611) . 

5 Figure 4 shows the splice donor/acceptor sites for 

human HtrA3 and PRSS11-L transcripts on the HtrA3 gene. E 
stands for exon. 

DETAILED DESCRIPTION 

10 Definitions 

As used herein, the term "family" refers to two or 
more proteins or nucleic acid. molecules having a common 
structural domain and having the amino acid or nucleotide 
sequence identity defined herein. Family members can be 

15 from either the same or different species. For example, 
a family can comprises two or more proteins of human 
origin, or can comprise one or more proteins of human 
origin and one or more of non-human origin. 

The term "protein superf amily" as used herein refers to 

20 proteins whose evolutionary relationship can not be entirely 
established or can be distant by accepted phylogenetic 
standards yet show similar three dimensional structures and/ 
or display a unique consensus of critical amino acids. 
"Nucleic acid molecule" used herein refers to a 

25 polynucleotide molecule. The term "polynucleotide (s) " 
generally refers to any polyribonucleotide or 
polydeoxribonucleotide, including unmodified RNA or DNA or 
modified RNA or DNA. A used herein, polynucleotide refers 
to, among others, single- and double -stranded DNA, DNA that 

30 is a mixture of single- and double -stranded regions or 
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single-, double- and triple-stranded regions, single- and 
double -stranded RNA, and RNA that is mixture of single- and 
double -stranded regions, hybrid molecules comprising DNA and 
RNA that can be single -stranded or, more typically, double- 
5 stranded, or triple-stranded, or a mixture of single- and 
double- stranded regions. DNAs or RNAs with -backbones 
modified for stability or for other reasons are 
"polynucleotides" as that term is intended herein. 
Moreover, DNAs or RNAs comprising unusual bases, such as 

10 inosine, or modified bases, such as tritylated bases, to 
name just two examples, are polynucleotides as the term is 
used herein. It will be appreciated that a great variety of 
modifications have been made to DNA and RNA that serve many 
useful purposes known to those of skill in the art. The 

15 term polynucleotide as it is employed herein embraces such 
chemically, enzymatically or metabolically modified forms of 
polynucleotides, as well as the chemical forms of DNA and 
RNA characteristic of viruses and cells, including simple 
and complex cells, inter alia. Polynucleotides embraces 

20 short polynucleotides often referred to as 

oligonucleotide (s) . For purposes of this disclosure, the 
term ''nucleic acid" is also construed to include the 
complimentary sequence. 

An "isolated" nucleic acid molecule is one which is 

25 substantially separated from nucleic acid molecules with 
differing nucleic acid sequences. Embodiments of the 
isolated nucleic acid molecule of the invention include cDNA 
as well as genomic DNA and RNA which are all preferably of 
human origin. 
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As used herein, the terms "gene" and "recombinant gene" 
refer to nucleic acid molecules comprising an open reading 
frame encoding a polypeptide. 

The terms a "PRSS11-L gene", "gene for a PRSS11-L 
5 protease", and "gene for a PRSS11-L protein" as used herein, 
all refer to a DNA molecule that encodes a PRSS11-L protein. 
Examples of a "PRSS11-L gene" include DNA molecules 
characterized by a nucleotide sequence that is substantially 
similar to that from nucleotide 1011 to 2015 of SEQ ID NO: 
10 1. 

The term "regulatory region" or "regulatory sequence" 
is intended to include promoters, enhancers and other 
expression control elements (e.g., polyadenylation signals, 
and ribosome binding site (for bacterial expression) and/or, 

15 an operator). Such regulatory sequences are described and 
can be readily determined using a variety of methods known 
to those skilled in the art (see for example, in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990) . Regulatory sequences 

20 include those that direct constitutive expression of a 

nucleotide sequence in many types of host cells and those 
that direct expression of the nucleotide sequence only in 
certain host cells (e.g., tissue-specific regulatory 
sequences) . 

25 As used herein, the term "allelic variant" refers to a 

nucleotide sequence which occurs at a given locus or to a 
polypeptide encoded by an allelic variant nucleotide 
sequence . 

The term "polypeptide", as used herein, refers to the 
30 basic chemical structure of polypeptides that is well known 
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and has been described in textbooks and other publications 
in the art. In this context, the term is used herein to 
refer to any peptide or protein comprising two or more amino 
acids joined to each other in a linear chain by peptide 
5 bonds. As used herein, the term refers to both short 

chains, which also commonly are referred to in the art as 
peptides, oligopeptides and oligomers, for example, and to 
longer chains, which generally are referred to in the art as 
proteins, of which there are many types. 

10 It will be appreciated that polypeptides often contain 

amino acids other than the 20 amino acids commonly referred 
to as the 20 naturally occurring amino acids, and that many 
amino acids, including the terminal amino acids, can be 
modified in a given polypeptide, either by natural 

15 processes, such as processing and other post- translational 
modifications, but also by chemical modification techniques 
which are well known to the art. Even the common 
modifications that occur naturally in polypeptides are too 
numerous to list exhaustively here, but they are well 

20 described in basic texts and in more detailed monographs, as 
well as in the research literature, and are therefore well 
known to those of skill in the art. Among the known 
modifications which can be present in polypeptides of the 
present invention are, to name an illustrative few, 

25 acetylation, acylation, ADP- ribosylation, amidation, 

covalent attachment of flavin, covalent attachment of a heme 
moiety, covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidylinositol , 

30 cross-linking, cyclization, disulfide bond formation, 
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demethylation, formation of covalent cross- links, formation 
of cystine, formation of pyroglutamate, formylation, gamma - 
carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, 
5 oxidation, proteolytic processing, phosphorylation, 
prenylation, racemization, selenoylation, sulfation, 
transfer-RNA mediated addition of amino acids to proteins 
such as arginylation, and ubiquitination . Such 
modifications are well known to those of skill and have been 

10 described in great detail in the scientific literature. 

Several particularly common modifications, such as 
glycosylation, lipid attachment, sulfation, gamma - 
carboxylation of glutamic acid residues, hydroxylation and 
ADP-ribosylation are described in many basic texts, 

15 including PROTEINS-- STRUCTURE AND MOLECULAR PROPERTIES, 2nd 
Ed., T. E. Creighton, W. H. Freeman and Company, New York 
(1993) . Many detailed reviews are also available on this 
subject, such as, for example, those provided by Wold, F., 
Posttranslational Protein Modifications: Perspectives and 

20 Prospects, pgs. 1-12 in POSTTRANSLATIONAL COVALENT 

MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic 
Press, New York (1983); Seifter et al . , (1990), Meth. 
Enzymol. 182, 626-646 and Rattan et al . , "Protein Synthesis: 
Posttranslational Modifications and Aging", (1992) Ann. N.Y. 

25 Acad. Sci. 663, 48-62. 

It will be appreciated, as is well known and as noted 
above, that polypeptides are not always entirely linear. 
For instance, polypeptides can be posttranslationly 
modified, including via natural processing or through human 

30 manipulation. Circular, branched and branched circular 
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polypeptides can be synthesized by non-translation natural 
processes and by entirely synthetic methods, as well. 
Modifications can occur anywhere in a polypeptide, including 
the peptide backbone, the amino acid side-chains and the 
5 amino or carboxyl termini. In fact, blockage of the amino 
or carboxyl group or both in a polypeptide by a covalent 
modification, is common in naturally occurring and synthetic 
polypeptides and such modifications can be present in 
polypeptides of the present invention, as well. For . 

10 instance, the amino terminal residue of polypeptides made in 
E. coli or other cells, prior to proteolytic processing, 
almost invariably will be N-f ormylmethionine . During post- 
translational modification of the peptide, a methionine 
residue at the NH 2 -terminus can be deleted. Accordingly, 

15 this invention contemplates the use of both the methionine- 
containing and the methionineless amino terminal variants of 
the protein of the invention. 

The modifications that occur in a polypeptide often 
will be a function of how it is made. For polypeptides made 

20 by expressing a cloned gene in a host, for instance, the 

nature and extent of the modifications in large part will be 
determined by the host cell posttranslational modification 
capacity and the modification signals present in the 
polypeptide amino acid sequence. For instance, as is well 

25 known, glycosylation often does not occur in bacterial hosts 
such as, for example, E. coli. Accordingly, when 
glycosylation is desired, a polypeptide should be expressed 
in a glycosylating host, generally a eukaryotic cell. 
Insect cells often carry out the same posttranslational 

30 glycosylations as mammalian cells and, for this reason, 
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insect cell expression systems have been developed to 
efficiently express mammalian proteins having native 
patterns of glycosylation, inter alia. Similar 
considerations apply to other modifications. It will be 
5 appreciated that the same type of modifications can be 

present in the same or varying degree at several sites in a 
given polypeptide. Also, a given polypeptide can contain 
many types of modifications. In general, as used herein, 
the term "polypeptide" encompasses all such modifications, 
10 particularly those that are present in polypeptides 

synthesized recombinantly by expressing a polynucleotide in 
a host cell. 

The term "protein domain" as used herein refers to a 
region of a protein having a particular three-dimensional 

15 structure that has functional characteristics independent 
from the remainder of the protein. This structure can 
provide a particular activity to the protein. Exemplary 
activities include, without limitation, enzymatic activity, 
creation of a recognition motif for another molecule, or to 

20 provide necessary structural components for a protein to 
exist in a particular environment. Protein domains are 
usually evolutionarily conserved regions of proteins, both 
within a protein family and within protein superf amilies 
that perform similar functions. 

25 An "isolated" or "purified" protein or biologically 

active portion thereof is substantially free of cellular 
material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or 
substantially free of chemical precursors or other chemicals 

30 when chemically synthesized. The language "substantially 
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free of cellular material" includes preparations of protein 
in which the protein is separated from cellular components 
of the cells from which it is isolated or recombinantly 
produced. Thus, protein that is substantially free of 
cellular material includes preparations of protein having 
less than about 30%, 20%, 10%, or 5% (by dry weight) of 
heterologous protein (also referred to herein as a 
"contaminating protein") . When the protein or biologically 
active portion thereof is recombinantly produced, it is also 
preferably substantially free of culture medium, i.e., 
culture medium represents less than about 20%, 10%, or 5 % 
of the volume of the protein preparation. When the protein 
is produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 
chemicals, i.e., it is separated from chemical precursors or 
other chemicals which are involved in the synthesis of the 
protein. Accordingly such preparations of the protein have 
less than about 30%, 20%, 10%, 5% (by dry weight) of 
chemical precursors or compounds other than the polypeptide 
of interest. Isolated biologically active polypeptide can 
have several different physical forms. The isolated 
polypeptide can exist as a full-length nascent or 
unprocessed polypeptide, or as partially processed 
polypeptides or combinations of processed polypeptides. The 
full-length nascent protease PRSS11-L polypeptide can be 
postranslationally modified by specific proteolytic cleavage 
events that results in the formation of fragments of the 
full length nascent polypeptide. A fragment, or physical 
association of fragments can have the full biological 
activity associated with protease PRSS11-L however, the 
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serine protease, or some other proteins; (3) the ability 
to act as a chaperone protein, e.g., to renature 
misfolded proteins and help to restore their function; 
(4) the ability to perform one or more of the described 
functions of the S2 serine protease, such as that of 
human HtrA (Hu et al., (1998), J Biol Chem. 
273 (51) :34406-34412, Suzuki et al.,(2001), Mol Cell 
8:613-21) . 

"Sequence identity or similarity", as known in the art, 
is the relationship between two or more polypeptide 
sequences or two or more polynucleotide sequences, as 
determined by comparing the sequences. In the art, identity 
also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case can be, 
as determined by the match between strings of such 
sequences. To determine the percent identity or similarity 
of two amino acid sequences or of two nucleic acids, the 
sequences are aligned for optimal comparison purposes (e.g., 
gaps can be introduced in the sequence of a first amino acid 
or nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues or 
nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position in 
the first sequence is occupied by the same or similar amino 
acid residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical or 
similar at that position. The percent identity or 
similarity between the two sequences is a function of the 
number of identical or similar positions shared by the 
sequences (i.e., % identity = # of identical positions/total 
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# of positions (e.g., overlapping positions) x 100). In one 
embodiment, the two sequences are the same length. 

Both identity and similarity can be readily calculated. 
Methods commonly employed to determine identity or 
similarity between sequences include, but are not limited to 
those disclosed in Carillo et al, (1988), SIAM J. Applied 
Math. 48, 1073. Preferred methods to determine identity are 
designed to give the largest match between the sequences 
tested. Methods to determine identity and similarity are 
codified in computer programs. 

A preferred, non-limiting example of a mathematical 
algorithm utilized for the comparison of two sequences is 
the algorithm of Karlin et al . , (1990), Proc. Natl. Acad. 
Sci. USA 87:2264-2268, modified as in Karlin et al . , (1993), 
Proc. Natl. Acad. Sci. USA 90:5873-5877. Such an algorithm 
is incorporated into the NBLAST and XBLAST programs of 
Altschul et al., (1990), J Mol . Biol 215:403-410. To obtain 
gapped alignments for comparison purposes, Gapped BLAST can 
be utilized as described in Altschul et al . , (1997), Nucleic 
Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be 
used to perform an- iterated search which detects distant 
relationships between molecules. When utilizing BLAST, 
Gapped BLAST, and PSI-Blast programs, the default parameters 
of the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Additionally, there 
is the FASTA method (Atschul et al . , (1990), J". Molec. Biol. 
215, 403) , which can be used. 

Another preferred, non-limiting example of a 
mathematical algorithm useful for the comparison of 
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sequences is the algorithm of Myers et al, (1988), CABIOS 
4:11-17. Such an algorithm is incorporated into the ALIGN 
program (version 2.0) which is part of the GCG sequence 
alignment software package (Devereux et al . , (1984), Nucleic 
Acids Research 12 (1) , 3 87) . 

The percent identity between two sequences can be 
determined using techniques similar to those described 
above, with or without allowing gaps. In calculating 
percent identity, only exact matches are counted. 

As used herein, the term "vector" refers to a nucleic 
acid molecule capable of transporting another nucleic acid 
to which it has been linked. One type of vector is a 
"plasmid", which refers to a circular double stranded DNA 
loop into which additional DNA segments can be inserted. 
Another type of vector is a viral vector, wherein additional 
DNA segments can be inserted. Certain vectors are capable 
of autonomous replication in a host cell into which they are 
introduced (e. g., bacterial vectors having a bacterial 
origin of replication and episomal mammalian vectors) . 
Other vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction 
into the host cell, and thereby are replicated along with 
the host genome. Moreover expression vectors are capable of 
directing the expression of genes to which they are operably 
linked. In general, vectors of utility in recombinant DNA 
techniques are often in the form of plasmids. However, the 
invention is intended to include other types of vectors, 
such as viral vectors (e.g., replication defective 
retroviruses, adenoviruses and adeno-associated viruses) , 
which serve equivalent functions. 
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The term "linker region"' or "linker domain" or similar 
such descriptive terms as used herein refers to one or more 
polynucleotide or polypeptide sequences that are used in the 
construction of a cloning vector or fusion protein. The 
5 function of a linker region can include introduction of 

cloning sites into the nucleotide sequence, introduction of 
a flexible component or space-creating region between two 
protein domains, or creation of an affinity tag to 
facilitate a specific molecule interaction. A linker region 

10 can be introduced into a fusion protein, if desired, during 
polypeptide or nucleotide sequence construction. 

The term "cloning site" or "polycloning site" as used 
herein refers to a region of the nucleotide sequence that 
has one or more available restriction endonuclease consensus 

15 cleavage sequences. These nucleotide sequences can be used 
for a variety of purposes including, but not limited to, 
introduction of these sequences into DNA vectors to create 
novel fusion proteins, or to introduce specific site- 
directed mutations. It is well known by those of ordinary 

20 skill in the art that cloning sites can be engineered at a 
desired location by silent mutation, conserved mutation, or 
introduction of a linker region that contains desired 
restriction endonuclease recognition sequences. It is also 
well known by those of ordinary skill in the art that the 

25 precise location of a cloning site can be engineered into 
any location in ^a nucleotide sequence. 

The term "tag" as used herein refers to an amino acid 
sequence or a nucleotide sequence that encodes an amino acid 
sequence that facilitates isolation, purification or 

30 detection of a protein containing the tag. A wide variety 
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of such tags are known to those skilled in the art and are 
suitable for use in the present invention. Suitable tags 
include, but are not limited to, HA peptide, polyhistidine 
peptides, biotin/avidin, and a variety of antibody epitope 
binding sites. 

As used herein, the term "host cell" refers to a cell 
that contains a DNA molecule of the invention either on a 
vector or integrated into a cell chromosome. 

As used herein, a "recombinant host cell" is a cell 
which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous DNA sequence. 
As used herein, a cell has been "transformed" by exogenous 
DNA when such exogenous DNA has been introduced inside the 
cell membrane. Exogenous DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the 
genome of the cell. In prokaryotes and yeasts, for example, 
the exogenous DNA may be maintained on an episomal element, 
such as a plasmid. With respect to eukaryotic cells, a 
stably transformed or transfected cell is one in which the 
exogenous DNA has become integrated into the chromosome so 
that it is inherited by daughter cells through chromosome 
replication. This stability is demonstrated by the ability 
of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cells containing the 
exogenous DNA. Recombinant host cells may be prokaryotic or 
eukaryotic, including but not limited to bacteria such as E. 
coli, fungal cells such as yeast, mammalian cells including 
but not limited to cell lines of human, bovine, porcine, 
monkey and rodent origin, and insect cells including but not 
limited to Drosophila and silkworm derived cell lines. It is 
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further understood that the term "recombinant host cell" 
refers not only to the particular subject cell but also to 
the progeny or potential progeny of such a cell. Because 
certain modifications can occur in succeeding generations 
5 due to either mutation or environmental influences, such 
progeny may not, in fact, be identical to the parent cell, 
but are still included within the scope of the term as used 
herein. 

As used herein, a "clone" is a population of cells 

10 derived from a single cell or common ancestor by mitosis. A 
"cell line" is a clone of a primary cell that is capable of 
stable growth in vitro for many generations. 

As used herein, the term "a biological sample" can be 
tissues, cells and biological fluids isolated from a 

15 subject, as well as tissues, cells and fluids present within 
a subject. In one embodiment, the biological sample 
contains protein molecules from the subject. The protein 
molecules may or may not remain their native biological 
activities. In another embodiment, the biological sample 

20 contains nuclei acid including genomic DNA, and/or mRNA 

molecules from the test subject or from the subject. In yet 
another embodiment, the biological sample contains both 
proteins and nucleic acid molecules. 

As used herein, the term "a disorder related to PRSS11- 

25 L" shall include a disorder or disease associated with 
overactivity or insufficient activity of PRSS11-L, and 
conditions that accompany this disorder or disease. 

The term "overactivity of PRSS11-L" refers to either 1) 
PRSS11-L expression in cells which normally do not express 

30 PRSS11-L; 2) increased PRSS11-L expression; or 3) mutations 
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leading to constitutive activation of one or more PRSS11-L 
biological activities. 

The term "insufficient activity of PRSS11-L" refers to 
either 1) the absence of PRSS11-L expression in cells which 
5 normally express PRSS11-L; 2) decreased PRSS11-L expression;, 
or 3) mutations leading to constitutive inactivation of one 
or more PRSS11-L biological activities. 

The term "subject" as used herein, refers to an animal, 
preferably a mammal, most preferably a human, who has been 
10 the object of treatment, observation or experiment. 

Unless defined otherwise, all technical and scientific 
terms used herein have the same meaning as is commonly 
understood by one of skill in the art. 

Protease PRSSll-Like Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode a PRSSll-Like protease or 
a biologically active portion thereof, as well as nucleic 
acid molecules of at least 12 sequential nucleotides in 
length for use as hybridization probes or PCR primers, to 
identify or amplify nucleic acid molecules encoding a 
PRSS11-L polypeptide of the invention. 
Isolated Nucleic Acid Molecule of The Invention 

In one embodiment, the invention provides an isolated 
nucleic acid molecule capable of encoding a S2 serine 
protease, comprising a nucleotide sequence encoding a 
polypeptide which has at least a 90% sequence identity to 
amino acids 1 through 9 of SEQ ID NO: 2, or a complement 
thereof. A nucleic acid molecule which is a complement of a 
given nucleotide sequence is one which is sufficiently 
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In a preferred embodiment, the invention provides an 
isolated nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO: 1, or a complement thereof. 

In another preferred embodiment, the invention provides 
an isolated nucleic acid molecule comprising a nucleotide 
sequence encoding a polypeptide of SEQ ID NO: 2, or a 
complement thereof. For example, the invention provides an 
isolated nucleic acid molecule that comprises a nucleic acid 
molecule which is a degenerate variant of PRSS11-L DNA as 
set forth in SEQ ID NO: 1, or a complement thereof. It is 
known that more than one genetic codon can be used to encode 
a particular amino acid, and therefore, the amino acid 
sequence of PRSS11-L protein SEQ ID NO: 2 can be encoded by 
any of a set. of similar DNA molecules. Only one member of 
the set will be identical to PRSS11-L DNA as set forth in 
SEQ ID NO: 1; however all variants hereinafter referred to 
as degenerate variants are contemplated within the scope of 
this invention. Herein, a nucleic acid molecule bearing one 
or more alternative codons which encodes a polypeptide with 
amino acid sequence set forth as SEQ ID NO: 2, is defined as 
a degenerate variant of PRSS11-L DNA as set forth in SEQ ID 
NO: 1. 

Particularly preferred in this regard are natural 
allelic variants of PRSS11- L nucleic acid molecules. DNA 
sequence polymorphisms that lead to changes in the amino 
acid sequence can exist within a population (e.g., the human 
population) . Such genetic polymorphisms can exist among 
individuals within a population due to natural allelic 
variation. An allele is one of a group of genes which occur 
alternatively at a given genetic locus. Such natural 
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allelic variations can typically result in 1-5% variance in 
the nucleotide sequence of a given gene. Alternative 
alleles can be identified by sequencing the gene of interest 
in a number of different individuals, or by using 
5 hybridization probes to identify the same genetic locus in a 
variety of individuals. 

Further particularly preferred in this regard are 
nucleic acid molecules having any and all such nucleotide 
variations that are not known to occur naturally and which 

10 encode polypeptides having properties that are different 
than, but still maintain the functional activity of, the 
naturally occurring PRSS11-L protein. In addition to a 
naturally occurring variant such as a naturally occurring 
allelic variant, a variant of the polynucleotide or 

15 polypeptide can also be a variant that is not known to occur 
naturally. DNA sequences can be altered manually so as to 
code for a peptide having properties that are different from 
those of the naturally occurring peptide. Methods of 
altering the DNA sequences include, but are not limited to, 

20 site directed mutagenesis, chimeric substitution, and gene 

fusions. Site-directed mutagenesis is used to change one or 
more DNA residue and can result in one or more silent 
mutations, conservative mutations, or nonconservative 
mutations. Chimeric genes are prepared by swapping domains 

25 of similar or different genes to replace similar domains in 
the protease PRSS11-L gene. Similarly, fusion genes can be 
prepared that add domains to the protease PRSS11-L gene, 
such as an affinity tag to facilitate identification and 
isolation of the gene and protein, or a signal sequence to 

30 make PRSS11-L a secreted protein. 
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The variants of the PRSS11-L nucleic acid molecule of 
the invention are capable of hybridizing to SEQ ID NO: 1 
under high stringent hybridization condition. 
Isolation of PRSS11-L Nucleic Acid Molecules 
5 The present invention provides methods of isolating 

PRSS11-L nucleic acid molecules. Because PRSS11-L is 
widely expressed in several tissues (Example 3), many 
cells and cell lines other than prostate cells can also 
be suitable as a source for the isolation of PRSS11-L 

10 cDNA. Selection of suitable cells as a cDNA source can 
be done by screening for the presence of PRSS11-L 
transcripts in cell extracts or in whole cell assays by 
Northern analysis using primers that hybridize 
specifically to PRSS11-L transcript. Cells that possess 

15 PRSS11-L transcript, preferably high levels of PRSS11-L 
transcript, are suitable for the isolation of PRSS11-L 
cDNA or mRNA. 

Any of a variety of procedures known in the art can be 
used to isolate a nucleic acid molecule encoding a PRSS11-L 

20 protein. For example, using cDNA or genomic DNA libraries, 
or total mRNA from the suitable cells identified above as a 
template and appropriate oligonucleotide as primers, a 
nucleic acid molecule of the invention can be amplified 
according to standard PCR amplification techniques. The 

25 nucleic acid so amplified from PCR can be cloned into an 
appropriate vector and characterized by DNA sequence 
analysis. The ordinarily skilled artisan will appreciate 
that oligonucleotides comprising at least 12 contiguous 
nucleotides of SEQ ID N0:1 are particularly useful as 

30 primers. The primers can be prepared by standard synthetic 
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techniques, e.g., using an automated DNA synthesizer. 
Particularly preferred primers are those that can be used to 
detect PRSS11-L gene but do not bind to other known S2 
serine protease genes, such as pPRSSll or HtrA2 . 
5 Another method to isolate PRSS11-L nucleic acid 

molecules is to probe a genomic or cDNA library, or total 
mRNA with one or more natural or artificially designed probe 
using procedures recognized by those familiar with the art. 
See, e.g., "Current Protocols in Molecular Biology", Ausubel 

10 et al. (eds.) Greene Publishing Association and John Wiley . 
Interscience, New York, 1989,1992. The ordinarily skilled 
artisan will appreciate that SEQ ID NO: 1 or fragments 
thereof comprising at least 12 contiguous nucleotides are 
particularly useful probes. Preferred probes will have at 

15 least 30 bases. Particularly preferred probes will have 50 
or less bases. It is also appreciated that such probes can 
be and are preferably labeled with an analytically 
detectable reagent to facilitate identification of the 
probe. Useful reagents include, but are not limited to, 

20 radioisotopes, fluorescent dyes or enzymes capable of 
catalyzing the formation of a detectable product. The 
probes enable the ordinarily skilled artisan to isolate 
complementary copies of genomic DNA, cDNA or RNA 
polynucleotides encoding PRSS11-L proteins from human, 

25 mammalian or other animal sources or to screen such sources 
for related sequences, e.g., additional members of the 
family, type and/or subtype, including transcriptional 
regulatory and control elements as well as other stability, 
processing, translation and tissue specificity-determining 

30 regions from 5' and/or 3' regions relative to the coding 
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sequences disclosed herein, all without undue 
experimentation . 

Another method to prepare nucleic acid molecules 
corresponding to all or a portion of a nucleic acid molecule 
of the invention is by standard synthetic techniques, e.g., 
using an automated DNA synthesizer. 

Another method to isolate PRSS11-L nucleic acid 
molecules is by using a reverse genetics method. In this 
example, the protease PRSS11-L is purified and the partial 
amino acid sequence is determined by automated amino acid 
sequenators. It is not necessary to determine the entire 
length of the amino acid sequence, but the linear sequence 
of two regions of at least about 4 to 8 amino acids from the 
protein is necessary for the production of primers for PCR 
amplification of a partial PRSS11-L DNA fragment. Once 
suitable amino acid sequences have been identified, the DNA 
sequences capable of encoding them can either be synthesized 
by standard synthetic techniques, e.g., using an automated 
DNA synthesizer, or by degenerate PCR according to 
established PCR amplification techniques using cDNA or 
genomic DNA libraries, or total mRNA as a template and to 
produce 15 to 3 0 degenerate oligonucleotides deduced from 
the amino acid sequence as primers. Because more than one 
genetic codon can be used to encode a particular amino acid, 
the amino acid sequence can be encoded by any of a set of 
similar DNA oligonucleotides. Thus, the frequency of codon 
usage in a particular host is taken into account in 
designing the degenerate PCR primers based on the known 
amino acid sequence. Often, each primer contains a pool of 
oligonucleotides to encourage specific hybridization to the 
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template DNA. Although maximally only one member of the 
pool will be identical to the protease PRSS11-L sequence and 
will be capable of hybridizing to protease PRSS11-L DNA, 
slightly mismatched primers are also able to hybridize to 
5 the protease PRSS11-L DNA under moderately stringent 

hybridization conditions to initiate a PCR amplification 
reaction. 

Preparation of cDNA libraries from the identified 
source cell can also be performed by standard techniques 

10 well known in the art. Well known cDNA library 

construction techniques can be found for example, in 
Maniatis et al., Molecular Cloning: A Laboratory Manual, 
Second Edition (Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York, 198 9) . 

15 Isolation of total mRNA from the identified source 

cell can be performed by standard techniques well known 
in the art. Well known techniques of total mRNA 
isolation can be found for example, in Maniatis et al . , 
supra . 

20 Construction of genomic DNA libraries can be 

performed by standard techniques well known in the art. 
Well known genomic DNA library construction techniques 
can be found in Maniatis et al . , supra. 

25 Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, 

preferably expression vectors, containing a nucleic acid of 

the present invention. 

Recombinant Expression Vectors 
30 The recombinant expression vectors of the invention 
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comprise a nucleic acid of the invention in a form suitable 
for expression of the nucleic acid in a host cell. This 
means that the recombinant expression vectors include one or 
more regulatory sequences. The sequences can be selected on 
5 the basis of the host cells to be used for expression. The 
regulatory sequences are operably linked to the nucleic acid 
sequence to be expressed. Within a recombinant expression 
vector, "operably linked'' is intended to mean that the 
nucleotide sequence of interest is linked to the regulatory 
10 sequence (s) in a manner which allows for expression of the 
nucleotide sequence (e.g., in an in vitro 

transcription/translation system or in a host cell when the 
vector is introduced into the host cell) . It will be 
appreciated by those skilled in the art that the design of 

15 the expression vector can depend on such factors as the 
choice of the host cell to be transformed, the level of 
expression of protein desired, etc. The expression vectors 
of the invention can be introduced into host cells to 
thereby produce proteins or peptides, including fusion 

20 proteins or peptides, encoded by nucleic acids as described 
herein. 

The recombinant expression vectors of the invention can 
be designed for expression of a polypeptide of the invention 
in prokaryotic (e.g., E. coli) or eukaryotic cells (e.g., 

25 insect cells (using baculovirus expression vectors) , yeast 
cells or mammalian cells) . Suitable host cells are known to 
those skilled in the art. Alternatively, the recombinant 
expression vector can be transcribed and translated in 
vitro, for example using T7 promoter regulatory sequences 

30 and T7 polymerase. 



ORT-1644CIP 



Expression of proteins in prokaryotes is most often 
carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non- fusion proteins. 
5 Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve 
four purposes: 1) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 

10 protein; 3) to aid in the purification of the recombinant 
protein by acting as a ligand in affinity purification; 
and/or 4) to facilitate detection of the recombinant 
protein by serving as a marker. Often, in fusion 
expression vectors, a proteolytic cleavage site is 

15 introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the 
recombinant protein from the fusion moiety subsequent to 
purification of the fusion protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, 

20 thrombin and enterokinase . Typical fusion expression 
vectors include pGEX (Pharmacia Biotech Inc; Smith et 
al., (1988), Gene 67:31-40), pMAL (New England Biolabs, 
Beverly, MA) , pRITS (Pharmacia, Piscataway, NJ) , or pQE 
(Qiagen) , which fuse glutathione S-transf erase (GST) , 

25 maltose binding protein, protein A, or poly-His, 
respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli 
expression vectors include pTrc (Amann et al . , (1988), Gene 
69:301-315) and pETIId (Studier et al . , Gene Expression 

30 Technology: Methods in Enzymology 185, Academic Press, San 
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Diego, California (1990) 60-89) . One strategy to maximize 
recombinant protein expression in E. coli is to express the 
protein in a host bacteria with an impaired capacity to 
proteolytically cleave the recombinant protein. Another 
strategy is to alter the nucleic acid sequence of the 
nucleic acid to be inserted into an expression vector so 
that the individual codons for each amino acid are those 
preferentially utilized in E. coli. Such alteration of 
nucleic acid sequences of the invention can be carried out 
by standard DNA synthesis techniques. 

In another embodiment, the expression vector is a yeast 
expression vector. Examples of vectors for expression in 
yeast S. cerivisae include pYepSecl (Baldari et al., (1987), 
EMBO J 6:229-234), pMFa (KurJan et al., (1982), Cell 30:933- 
943), pJRY88 (Schultz et al., (1987), Gene 54:113-123), 
pYES2 (Invitrogen Corporation, San Diego, CA) , and pPicZ or 
Pichia (Invitrogen Corp, San Diego, CA) . 

Alternatively, the expression vector is a baculovirus 
expression vector. Baculovirus vectors available for 
expression of proteins in cultured insect cells include, but 
are not limited to, the pAc series (Smith et al., (1983), 
Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow et 
al. , (1989) , Virology 170:31-3 9) . 

Alternatively, the expression vector is an insect 
expression vector. A variety of insect cell expression 
vectors can be used to express recombinant protease PRSS11-L 
in insect cells. Commercially available insect cell 
expression vectors useful for recombinant protease PRSS11-L 
expression, include, but are not limited to, pBlueBacII 
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(Invitrogen) . 

In yet another embodiment, the expression vector is a 
mammalian expression vector. When used in mammalian cells, 
the expression vector's control functions are often provided 
5 by viral regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. Examples of mammalian 
expression vectors include, but are not limited to, 
pCDM8 (Seed (1987) Nature 329:840) and pMT2PC (Kaufinan et 

10 al., (1987), EMBO J 6:187-195). Commercially available 
mammalian expression vectors which can be suitable for 
recombinant protease PRSS11-L expression, include, but are 
not limited to, pMAMneo (Clontech) , pcDNA3 (Invitrogen), 
pMClneo (Stratagene) , pXTl (Stratagene) , pSG5 (Stratagene) , 

15 EBO-pSV2-neo (ATCC 37593) pBPV-l(8-2) (ATCC 37110), pdBPV- 
MMTneo (342-12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo 
(ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), 
and 1ZD35 (ATCC 37565) . 

In another embodiment, the recombinant mammalian 

20 expression vector is capable of directing expression of the 
nucleic acid preferentially in a particular cell type (e.g., 
tissue-specific regulatory elements are used to express the 
nucleic acid) . Tissue-specific regulatory elements are 
known in the art. Non-limiting examples of suitable tissue- 

25 specific promoters include the albumin promoter (liver- 
specific; Pinkert et al . , (1987), Genes Dev. 1:268-277), 
lymphoid-specif ic promoters (Calame et al, (1988) , Adv. 
Immunol. 43:235-275), in particular promoters of T cell 
receptors (Winoto et al, (1989), EMBO J 8:729-733) and 

30 immunoglobulins (BaneiJi et al . , (1983), Cell 33:729- 740; 
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Queen et al., (1983), Cell 33:741-748), neuron-specific 
promoters (e.g., the neurofilament promoter; Byrne et al . , 
(1989), Proc. Natl. Acad. Sci. USA 86:5473- 5477), pancreas- 
specific promoters (Edlund et al .' , (1985), Science 230:912- 
916), and mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,316 and European Application 
Publication No. 264,166). Developmentally-regulated 
promoters also include, for example, the marine hox 
promoters (Kessel et al . , (1990), Science 249:374-379) and 
the beta-fetoprotein promoter (Campes et al . , (1989), Genes 
Dev. 3 :537-546) . 

The invention further provides a recombinant expression 
vector comprising a DNA molecule of the invention cloned 
into the expression vector in an antisense orientation. 
That is, the DNA molecule is operably linked to a regulatory 
sequence in a manner which allows for expression (by 
transcription of the DNA molecule) of an RNA molecule which 
is antisense to the mRNA encoding a polypeptide of the 
invention. Regulatory sequences operably linked to a 
nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 
specific or cell type specific expression of antisense RNA. 
The antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in which 
antisense nucleic acids are produced under the control of a 
high efficiency regulatory region, the activity of which can 
be determined by the cell type into which the vector is 
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introduced. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub et al . 
(Reviews - Trends in Genetics, Vol. 1(1) 1986). 

Specifically designed vectors allow the shuttling of 
5 DNA between hosts such as bacteria-yeast or bacteria-animal 
cells or bacteria- fungal cells or bacteria- invertebrate 
cells. Numerous cloning vectors are known to those of skill 
in the art and. the selection of an appropriate cloning 
vector is a matter of choice. For other suitable expression 
10 systems for both prokaryotic and eukaryotic cells see 
chapters 16 and 17 of Maniatis et al . , supra. 
Recombinant Host Cells 

Another aspect of the invention pertains to 
recombinant host cells into which a recombinant 
15 expression vector of the invention has been introduced. 

Cell lines derived from mammalian species which can 
be suitable for transfection and which are commercially 
available, include, but are not limited to, CV-1 (ATCC 
CCL 70), COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), 
20 CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC 

CRL 1658), HeLa (ATCC CCL 2), C127I (ATCC CRL 1616), BS- 
C-l (ATCC CCL 26), MRC-5 (ATCC CCL 171), Drosophila or 
murine L-cells, and HEK-293 (ATCC CRL1573) , and monkey 
kidney cells. 

25 Vector DNA can be introduced into prokaryotic or 

eukaryotic cells via conventional transformation or 
transfection techniques. As used herein, the terms 
"transformation" or "transfection" refers to a process by 
which cells take up foreign DNA and may or may not integrate 

30 that foreign DNA into their chromosome. Transfection can be 
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accomplished, for example, by various techniques including 
calcium phosphate or calcium chloride co-precipitation, 
DEAE-dextran-mediated transf ection, lipofection, or 
electroporation, protoplast fusion. Suitable methods for 
5 transforming or transfecting host cells can be found in 
Maniatis et al . (supra), and other laboratory manuals. 

For stable transf ection of mammalian cells, it is known 
that, depending upon the expression vector and transf ection 
technique used, only a small fraction of cells may integrate 

10 the foreign DNA into their genome. In order to identify and 
select these integrants, a gene that encodes a selectable 
marker (e.g., for resistance to antibiotics) is generally 
introduced into the host cells along with the gene of 
interest. Preferred selectable markers include those which 

15 confer resistance to drugs, such as G418, hygromycin and 

methotrexate. Cells stably transf ected with the introduced 
nucleic acid can be identified by drug selection (e.g., 
cells that have incorporated the selectable marker gene will 
survive, while the other cells die) . 

20 In another embodiment, the expression characteristics 

of an endogenous (e.g., PRSS11-L) nucleic acid within a 
cell, cell line or microorganism, can be modified by 
inserting a DNA regulatory element heterologous to the 
endogenous gene of interest into the genome of a cell, a 

25 stable cell line or a cloned microorganism such that the 
inserted regulatory element is operatively linked with the 
endogenous gene (e.g., PRSS11-L) and controls, modulates or 
activates the endogenous gene. For example, endogenous 
PRSS11-L which is normally "transcriptionally silent", i.e., 

30 PRSS11-L gene which is normally not expressed, or is 
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expressed only at very low levels in a cell line or 
microorganism, can be activated by inserting a regulatory 
element which is capable of promoting the expression of a 
normally expressed gene product in that cell line or 
5 microorganism. Alternatively, transcriptionally silent, 

endogenous PRSS11-L genes can be activated by insertion of a 
promiscuous regulatory element that is active across cell 
types. 

A heterologous regulatory element can be inserted into 

10 a stable cell line or cloned microorganism, such that it is 
operatively linked with and activates expression of 
endogenous PRSS11-L genes, using techniques, such as 
targeted homologous recombination, which is well known to 
those of skill in the art, and described e.g., in Chappel, 

15 U.S. Patent No. 5,272,071; PCT publication No. WO 91/06667, 
published May 16, 1991. 

The recombinant host cells of the invention can also be 
used to produce nonhuman transgenic animals. For example, 
in one embodiment, a host cell of the invention is a 

20 fertilized oocyte or an embryonic stem cell into which a 
sequence encoding a polypeptide of the invention has been 
introduced. Such host cells can then be used to create non- 
human transgenic animals in which exogenous sequences 
encoding a polypeptide of the invention have been introduced 

25 into their genome or homologous recombinant animals in which 
endogenous sequences encoding a polypeptide of the invention 
have been altered. Such animals are useful for studying the 
function and/or activity of the polypeptide and for 
identifying and/or evaluating modulators of polypeptide 

30 activity. As used herein, a "transgenic animal" is a non- 
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human animal, preferably a mammal, more preferably a rodent 
such as a rat or mouse, in which one or more of the cells of 
the animal includes a transgene. Other examples of 
transgenic animals include non-human primates, sheep, dogs, 
5 cows, goats, chickens, amphibians, etc. A transgene is 

exogenous DNA which is integrated into the genome of a cell 
from which a transgenic animal develops and which remains in 
the genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell 
10 types or tissues of the transgenic animal. Clones of the 
non-human transgenic animals can also be produced according 
to the methods described in Wilmut et al . , (1997), Nature 3 
8 5:8 10- 813 and PCT Publication NOS. WO 97/07668 and WO 
97/07669. 

15 

Protease PRSSll-Like Polypeptides 

One other aspect of the invention pertains to 

substantially purified PRSS11-L polypeptides. 

Isolated Polypeptides of The Present Invention 
20 In one embodiment, the invention provides a 

substantially purified polypeptide having S2 serine protease 

activity, comprising an amino acid sequence having at least 

a 90% and preferably a 95% sequence identity to amino acid 1 

to 9 of SEQ ID NO: 2. The identity and similarity between 
25 two polypeptides can be determined using the method 

described supra. 

In a preferred embodiment, the polypeptides of the 

present invention comprise an amino acid sequence that is 

identical to SEQ ID NO: 2. 
30 The isolated polypeptide of the invention includes 
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chimeric or fusion proteins. As used herein, a "chimeric 
protein" or "fusion protein" comprises all or part 

(preferably biologically active) of a polypeptide of the 
invention operably linked to a heterologous polypeptide 

(i.e., a polypeptide other than the same polypeptide of the. 
invention) . Within the fusion protein, the term "operably 
linked" is intended to indicate that the polypeptide of the 
invention and the heterologous polypeptide are fused in- 
frame to each other. The heterologous polypeptide can be 
fused to the N-terminus or C-terminus of the polypeptide of 
the invention. 

One useful fusion protein is a HAHis fusion protein in 
which the polypeptide of the invention is fused at the C- 
terminus to a tag made of HA and poly His. Such fusion 
proteins facilitate the detection and purification of a 
recombinant polypeptide of the invention. 

In another embodiment, the fusion protein contains a 
heterologous signal sequence at its N-terminus. A signal 
sequence from another protein, for example, the gp67 signal 
sequence of the baculovir-Us envelope protein (Current 
Protocols in Molecular Biology, Ausubel et al . , eds . , John 
Wiley Sc Sons, 1992), the signal sequences of melittin and 
human placental alkaline phosphatase (Stratagene; La Jolla, 
California) , and useful prokaryotic signal sequences include 
the phoA secretory signal and the protein A secretory 
signal (Pharmacia Biotech; Piscataway, New Jersey) , can all 
be used as a heterologous signal sequence to direct the 
secretion of PRSS11-L. 

In yet another embodiment, the fusion protein is an 
immunoglobulin fusion protein in which all or part of a 
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polypeptide of the invention is fused to sequences derived 
from a member of the immunoglobulin protein family. The 
immunoglobulin fusion proteins of the invention can be 
incorporated into pharmaceutical compositions and 
5 administered to a subject to inhibit an interaction between 
a ligand (soluble or membrane -bound) and a protein on the 
surface of a cell (receptor) , to thereby suppress signal 
transduction in vivo. The immunoglobulin fusion protein can 
be used to affect the bioavailability of a cognate ligand of 

10 a polypeptide of the invention. Inhibition of 

ligand/receptor interaction can be useful therapeutically, 
both for treating proliferative and dif f erentiative 
disorders and for modulating (e.g. promoting or inhibiting) 
cell survival. Moreover, the immunoglobulin fusion proteins 

15 of the invention can be used as immunogens to produce 

antibodies directed against a polypeptide of the invention 
in a subject, to purify ligands and in screening assays to 
identify molecules which inhibit the interaction of 
receptors with ligands. 

20 Expression And Isolation of Protease PRSS11-L 

The invention pertains to methods of expressing or 
isolating a polypeptide of the invention. 

As used herein, "recombinant" polypeptides refer to 
polypeptides produced by recombinant DNA techniques; i.e., 

25 produced from cells transformed by an exogenous DNA 

construct encoding the desired polypeptide. "Synthetic" 
polypeptides are those prepared by chemical synthesis. 

In one embodiment, the polypeptide can be isolated 
from cells or tissue sources that express it naturally by 

30 an appropriate purification scheme using standard protein 
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purification techniques. In another embodiment, 
polypeptides of the invention are produced by recombinant 
DNA techniques. Alternatively, a polypeptide of the 
invention can be synthesized in an in vitro translation 
and/or transcription system. Further alternatively, a 
polypeptide of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 

Polypeptides of the invention can be recombinantly 
expressed by cloning DNA molecules of the invention into 
an expression vector described supra, introducing such a 
vector into prokaryotic or eukaryotic host cells 
described herein, and growing the host cell under 
conditions suitable for production of recombinant 
protease PRSS11-L protein. The expression vector- 
containing cells are clonally propagated and individually 
analyzed to determine whether they produce protease 
PRSS11-L protein. Identification of protease PRSS11-L 
expressing host cell clones can be done by several means, 
including, but not limited to, immunological reactivity 
with anti-protease PRSS11-L antibodies, and the presence 
of host cell-associated protease PRSS11-L activity. The 
selection of the appropriate growth conditions and 
recovery methods are within the skill of the art. 
Techniques for recombinantly expressing a polypeptide are 
fully described in Maniatis, T, et al . , supra, and are 
well known in the art. 

Polypeptides of the invention can also be produced 
using an in vitro translation and/or transcription 
system. Such methods are known to those skilled in the 
art. For example, synthetic PRSS11-L mRNA or mRNA 
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isolated from protease PRSS11-L producing cells can be 
efficiently translated in various cell-free systems, 
including, but not limited to, wheat germ extracts and 
reticulocyte extracts. Alternatively, the coding 
sequence of PRSS11-L cDNA can be cloned under the control 
of a T7 promoter. Then, using this construct as the 
template, PRSS11-L protein can be produced in an in vitro 
transcription and translation system, for example using a 
TNT T7 coupled Reticulocyte Lysate System such as that 
commercially available from Promega (Madison, WI) . 

Polypeptides of the invention can also be produced 
by chemical synthesis, such as solid phase peptide 
synthesis on an automated peptide synthesizer, using 
known amino acid sequences or amino acid sequences 
derived from the DNA sequence of the genes of interest. 
Such methods are known to those skilled in the art. 

Following expression of protease PRSS11-L in a 
recombinant host cell, protease PRSS11-L protein can be 
recovered to provide purified protease PRSS11-L in active 
form. Such methods are known to those skilled in the 
art. For example, protease PRSS11-L from natural host 
cells, or recombinant protease PRSS11-L from recombinant 
host can be purified from cell lysates and extracts, or 
from conditioned culture medium, by various combinations 
of, or individual application of salt fractionation, ion 
exchange chromatography, size exclusion chromatography, 
hydroxylapatite adsorption chromatography and hydrophobic 
interaction chromatography, lectin chromatography, HPLC, 
and FPLC, and antibody/ligand affinity chromatography. 

Protease PRSS11-L can be separated from other cellular 
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term "antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of 
immunoglobulin molecules, i.e., molecules that contain an 
antigen binding site which specifically binds an antigen, 
such as a polypeptide of the invention, e.g., an epitope of 
a polypeptide of the invention. A molecule which 
specifically binds to a given polypeptide of the invention 
is a molecule which binds the polypeptide, but does not 
substantially bind other molecules in a sample, e.g., a 
biological sample, which naturally contains the polypeptide. 
Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab) 2 fragments 
which can be generated by treating the antibody with an 
enzyme such as pepsin. 

In various embodiments, the substantially purified 
antibodies of the invention, or fragments thereof, can be 
human, non-human, chimeric and/or humanized antibodies. 
Such antibodies of the invention can be, but are not limited 
to, goat, mouse, rat, sheep, horse, chicken, or rabbit, 
antibodies. In addition, the invention provides polyclonal 
and monoclonal antibodies. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers to 
a population of antibody molecules that contain only one 
species of an antigen binding site capable of immunoreacting 
with a particular epitope. The term "polyclonal antibody" 
refers to , antibodies directed against a polypeptide or 
polypeptides of the invention capable of immunoreacting with 
more than one epitopes. Particularly preferred polyclonal 
antibody preparations are ones that contain only antibodies 
directed against a polypeptide or polypeptides of the 
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Such acceptable adjuvants include, but are not limited to, 
Freund's complete, Freund's incomplete, alum-precipitate, 
water in oil emulsion containing Corynebacterium parvum and 
tRNA. The initial immunization consists of protease PRSS11- 
L in, preferably, Freund's complete adjuvant at multiple 
sites either subcutaneously (SC) , intraperitoneally (IP) or 
both. Each animal is bled at regular intervals, preferably 
weekly, to determine antibody titer. The animals may or may 
not receive booster injections following the initial 
immunizaiton. Those animals receiving booster injections 
are generally given an equal amount of the antigen in 
Freund's incomplete adjuvant by the same route. Booster 
injections are given at about three-week intervals until 
maximal titers are obtained. At about 7 days after each 
booster immunization or about weekly after a single 
immunization, the animals are bled, the serum collected, and 
aliquots are stored at about -20 °C. 

Monoclonal antibodies (mAb) are prepared by 
immunizing inbred mice, preferably Balb/c, with the 
immunogen. The mice are immunized by the IP or SC route 
with about 0.001 mg to about 1.0 mg, preferably about 0.1 
mg, of the immunogen in about 0 . 1 ml buffer or saline 
incorporated in an equal volume of an acceptable 
adjuvant, as discussed above. Freund's adjuvant is 
preferred, with Freund's complete adjuvant being used for 
the initial immunization and Freund's incomplete adjuvant 
used thereafter. The mice receive an initial 
immunization on day 0 and are rested for about 2 to about 
30 weeks. Immunized mice are given one or more booster 
immunizations of about 0.001 to about 1.0 mg of protease 
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PRSS11-L in a buffer solution such as phosphate buffered 
saline by the intravenous (IV) route. Lymphocytes, from 
antibody positive mice, preferably spleenic lymphocytes, 
are obtained by removing spleens from immunized mice by 
5 standard procedures known in the art. Hybridoma cells 
are produced by mixing the splenic lymphocytes with an 
appropriate fusion partner, preferably myeloma cells, 
under conditions that will allow the formation of stable 
hybridomas. Fusion partners may include, but are not 

10 limited to: mouse myelomas P3/NSl/Ag 4-1; MPC-11; S-194 
and Sp2/0, with Sp2/0 being generally preferred. The 
antibody producing cells and myeloma cells are fused in 
polyethylene glycol, about 1000 mol . wt . , at 
concentrations from about 30% to about 50%. Fused 

15 hybridoma cells are selected by growth in hypoxanthine, 
thymidine and aminopterin supplemented Dulbecco's 
Modified Eagles Medium (DMEM) by procedures known in the 
art. Supernatant fluids are collected, from growth 
positive wells on about days 14, 18, and 21 and are 

20 screened for antibody production by an immunoassay such 
as solid phase immunoradioassay (SPIRA) using protease 
PRSS11-L as the antigen. The culture fluids are also 
tested in the Ouchterlony precipitation assay to 
determine the isotype of the mAb. Hybridoma cells from 

25 antibody positive wells are cloned by a technique such as 
the soft agar technique of MacPherson, Soft Agar 
Techniques, in Tissue Culture Methods and Applications, 
Kruse and Paterson, Eds., Academic Press, 1973 or by the 
technique of limited dilution. 
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Monoclonal antibodies are produced in vivo by 
injection of pristane primed Balb/c mice, approximately 
0.5 ml per mouse, with about 1 x 10 6 to about 6 x 10 6 
hybridoma cells at least about 4 days after priming. 
Ascites fluid is collected at approximately 8-12 days 
after cell transfer and the monoclonal antibodies are 
purified by techniques known in the art. 

Monoclonal Ab can also be produced in vitro by 
growing the hydridoma in tissue culture media well known 
in the art. High density in vitro cell culture can be 
conducted to produce large quantities of mAbs using 
hollow fiber culture techniques, air lift reactors, 
roller bottle, or spinner flasks culture techniques well 
known in the art. The mAb are purified by techniques 
known in the art . 

Antibody titers of ascites or hybridoma culture 
fluids are determined by various serological or 
immunological assays which include, but are not limited 
to, precipitation, passive agglutination, enzyme-linked 
immunosorbent antibody (ELISA) technique and 
radioimmunoassay (RIA) techniques. Similar assays are 
used to detect the presence of protease PRSS11-L in body 
fluids or tissue and cell extracts. 

The antibody molecules can be isolated from the mammal 
(e.g., from the blood) or culture cells and further purified 
by well-known techniques, such as protein A chromatography 
to obtain the IgG fraction. Alternatively, antibodies 
specific for a protein or polypeptide of the invention can 
be selected for (e.g., partially purified) or purified by, 
e.g., affinity chromatography. For example, a recombinantly 
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expressed and purified (or partially purified) protein of 
the invention is produced as described herein, and 
covalently or non-covalently coupled to a solid support such 
as, for example, a chromatography column. The column can 
then be used to affinity purify antibodies specific for the 
proteins of the invention from a sample containing 
antibodies directed against a large number of different 
epitopes, thereby generating a substantially purified 
antibody composition, i.e., one that is substantially free 
of contaminating antibodies. By a substantially purified 
antibody composition is meant, in this context, that the 
antibody sample contains at most only 3 0% (by dry weight) of 
contaminating antibodies directed against epitopes other 
than those on the desired protein or polypeptide of the 
invention, and preferably at most 20%, yet more preferably 
at most 10%, and most preferably at most 5% (by dry weight) 
of the sample is contaminating antibodies. A purified 
antibody composition means that at least 99% of the 
antibodies in the composition are directed against the 
desired protein or polypeptide of the invention. 

Additionally, recombinant antibodies, such as chimeric 
and humanized monoclonal antibodies, comprising both human 
and non-human portions, which can be made using standard 
recombinant DNA techniques, are within the scope of the 
invention. A chimeric antibody is a molecule in which 
different portions are derived from different animal 
species, such as those having a variable region derived from 
a murine mAb -and a human immunoglobulin constant region 
(See, e.g., Cabilly et al., U.S. Patent No. 4,816,567). 
Humanized antibodies are antibody molecules from non- human 



ORT-1644CIP 



46 



species having one or more complementarily determining 
regions (CDRs) from the non-human species and a framework 
region from a human immunoglobulin molecule (See, e.g., 
Queen, U.S. Patent No. 5,585,089). Such chimeric and 
humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art, for example 
using methods described in PCT Publication No. WO 87/02671, 
which is incorporated herein by reference in its entirety.. 

Completely human antibodies are particularly desirable 
for therapeutic treatment of human patients. Such 
antibodies can be produced, for example, using transgenic 
mice which are incapable of expressing endogenous 
immunoglobulin heavy and light chains genes, but which can 
express human heavy and light chain genes. The transgenic 
mice are immunized in the normal fashion with a selected 
antigen, e.g., all or a portion of a polypeptide of the 
invention. Monoclonal antibodies directed against the 
antigen can be obtained using conventional hybridoma 
technology. The human immunoglobulin transgenes harbored by 
the transgenic mice rearrange during B cell differentiation, 
and subsequently undergo class switching and somatic 
mutation. Thus, using such a technique, it is possible to 
produce therapeutically useful IgG, IgA and IgE antibodies. 
For an overview of this technology for producing human 
antibodies, see Lonberg et al . , (1995), Int. Rev. Immunol. 
13 :65-93) . 

Completely human antibodies which recognize a selected 
epitope can be generated using a technique referred to as 
"guided selection" . In this approach a selected non-human 
monoclonal antibody, e.g., a mouse antibody, is used to 
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guide the selection of a completely human antibody 
recognizing the same epitope (Jespers et al ., (1994), 
Biol technology 12:899-903) . 

The antibody of the invention (e.g., monoclonal 
5 antibody) can be used to isolate the polypeptide of the 
invention by standard techniques, such as affinity 
chromatography or immunoprecipitation. Moreover, such an 
antibody can be used to detect the protein (e.g., in a 
cellular lysate or cell supernatant) in order to evaluate 

10 the abundance and pattern of expression of the polypeptide. 
The antibodies can also be used diagnogtically to monitor 
protein levels in tissue as part of a clinical testing 
procedure, e.g., to, for example, determine the efficacy of 
a given treatment regimen. Detection can be facilitated by 

15 coupling the antibody to a detectable substance. Examples 
of detectable substances include various enzymes, prosthetic 
groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. 
Examples of suitable enzymes include horseradish peroxidase, 

20 alkaline phosphatase, beta-galactosidase, or 

acetyleholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; 
examples of suitable fluorescent materials include 
umbellif erone, fluorescein, fluorescein isothiocyanate, 

25 rhodamine, dichlorotriazinylamine fluorescein, dansyl 
chloride or phycoerythrin; an example of a luminescent 
material includes luminol; examples of bioluminescent 
materials include lucif erase, luciferin, and aequorin, and 
examples of suitable radioactive material include 125 I, 131 I, 

30 35 S or 3 H. Further, an antibody (or fragment thereof) can be 



ORT-1644CIP 



48 



conjugated to a therapeutic moiety such as a cytotoxin, a 
therapeutic agent or a radioactive metal ion. A cytotoxin or 
cytotoxic agent includes any agent that is detrimental to 
cells. Examples include taxol, cytochalasin B, gramicidin D, 
ethidium bromide, emetine, mitomycin, etoposide, tenoposide, 
vincristine, vinblastine, colchicin, doxorubicin, 
daunorubicin, dihydroxy anthracin dione, mitoxantrone, 
mithramycin, actinomycin D, I - dehydrotestosterone , 
glucocorticoids, procaine, tetracaine, lidocaine, 
propranolol, and puromycin and analogs or homologs thereof. 
Therapeutic agents include, but are not limited to, 
antimetabolites (e.g., methotrexate, 6-mercaptopurine , 6- 
thioguanine, cytarabine, 5- fluorouracil decarbazine) , 
alkylating agents (e.g., mechlorethamine, thioepa 
chlorambucil, melphalan, carmustine (BSNU) and lomustine 
(CCN-LJ) , cyclothosphamide, busulfan, dibromomannitol, 
streptozotocin, mitomycin C, and cis-dichlorodiamine 
platinum (11) (DDP) cisplatin) , anthracyclines (e.g., 
daunorubicin (formerly daunomycin) and doxorabicin) , 
antibiotics (e.g., dactinomycin (formerly actinomycin), 
bleomycin, mithramycin, and anthramycin (AMC) ) , and anti- 
mitotic agents (e.g., vincristine and vinblastine). 

The conjugates of the invention can be used for 
modifying a given biological response, the drug moiety is 
not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety can be a 
protein or polynucleotide possessing a desired biological 
activity. Alternatively, an antibody can be conjugated to a 
second antibody to form an antibody heteroconjugate as 
described by Segal in U.S. Patent No. 4,676, 980. 
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Methods of Detection 

The present invention also relates to methods of 
detecting a polypeptide or nucleic acid of the invention in 
a biological sample. Such assay can be used for diagnostic 
purpose . 

In one embodiment, the invention provides a method of 
detecting a nucleic acid molecule of a PRSS11-L gene, 
comprising the step of contacting a biological sample with a 
nucleic acid probe that hybridizes to the PRSS11-L nucleic 
acid molecule under stringent conditions arid detecting the 
probe-nucleic acid molecule complex. A preferred agent for 
detecting mRNA or genomic DNA of a PRSS11-L gene is a 
labeled nucleic acid probe capable of hybridizing to mRNA or 
genomic DNA encoding a polypeptide of the invention as 
described supra. 

In another embodiment, the invention provides a method 
of detecting a PRSS11-L protein, comprising the step of 
contacting a biological sample with an antibody that 
selectively binds to amino acids 1 to 9 of SEQ ID NO: 2, and 
detecting the protein-antibody complex. A preferred agent 
for detecting a polypeptide of the invention is an labeled 
antibody capable of binding to amino acids 1 to 9 of SEQ ID 
No: 2 as described supra. 

Proteins, mRNAs, and genomic DNAs, can be detected in 
vitro as well as in vivo. For example, in vitro techniques 
for detection of mRNA include Northern hybridization, in 
situ hybridization, and RT-PCR. In vitro techniques for 
detection of a polypeptide of the invention include enzyme 
linked immunosorbent assays (ELISAs) , Western blots, 
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immunoprecipitations and immunofluorescence. In vitro 
techniques for detection of genomic DNA include Southern 
hybridization and PCR. Furthermore, in vivo techniques for 
detection of mRNA include transcriptional fusion infra. In 
5 vivo techniques for detection of a polypeptide of the 

invention include translational fusion infra, or introducing 
into a subject a labeled antibody directed against the 
polypeptide. For example, the antibody can be labeled with 
a radioactive marker whose presence and location in a 

10 subject can be detected by standard imaging techniques. 

The invention also encompasses kits for detecting 
the presence of a polypeptide or nucleic acid of the 
invention in a biological sample (a test sample) . Such 
kits can be used to determine if a subject is suffering 

15 from or is at increased risk of developing a disorder 
related to PRSS11-L. Such a kit preferably comprises a 
compartmentalized carrier suitable to hold in close 
confinement at least one container. The carrier further 
comprises reagents such as recombinant PRSS11-L protein 

20 or anti-PRSSll-L antibodies suitable for detecting 

protease PRSS11-L. The carrier can also contain a means 
for detection such as labeled antigen or enzyme 
substrates or the like. For example, the kit can 
comprise a labeled compound or agent capable of detecting 

25 the polypeptide or mRNA encoding the polypeptide in a 

biological sample and means for determining the amount of 
the polypeptide or mRNA in the sample (e.g., an antibody 
which binds the polypeptide or an oligonucleotide probe 
which binds to DNA or mRNA encoding the polypeptide) . 

30 The kits can also include instructions for determining 
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whether a test subject is suffering from or is at risk of 
developing a disorder associated with aberrant expression 
of the polypeptide if the amount of the polypeptide or 
mRNA encoding the polypeptide is above or below a normal 
5 level. 

For antibody-based kits, the kit can comprise, for 
example: (1) a first antibody (e.g., an antibody attached to 
a solid support) which binds to a polypeptide of the 
invention; and, optionally; (2) a second, different antibody 

10 which binds to either the polypeptide or the first antibody 
and is conjugated to a detectable agent; and (3) a purified 
recombinant PRSS11-L protein as positive control. 

For oligonucleotide-based kits, the kit can comprise, 
for example: (1) an oligonucleotide, e.g., a detectably 

15 labeled oligonucleotide, which hybridizes to a nucleic acid 
sequence encoding a polypeptide of the invention or (2) a 
pair of primers useful for amplifying a nucleic acid 
molecule encoding a polypeptide of the invention. The kit 
can also comprise, e.g., a buffering agent, a preservative, 

20 or a protein stabilizing agent. The kit can also comprise 
components necessary for detecting the detectable agent 
(e.g., an enzyme or a substrate). The kit can also contain 
a control sample or a series of control samples which can be 
assayed and compared to the test sample contained. Each 

25 component of the kit is usually enclosed within an 

individual container and all of the various containers are 
within a single package along with instructions for 
observing whether the tested subject is suffering from or is 
at risk of developing a disorder associated with aberrant 

30 expression of the polypeptide. 
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The invention also provides a method to detect 
genetic lesions or mutations in a PRSS11-L gene, thereby 
determining if a subject with the lesioned gene is at 
risk for a disorder related to PRSS11-L. In preferred 
embodiments, the methods comprises the steps of: (1) 
isolating PRSS11-L polynucleotide from a biological 
sample; (2) detecting a polynucleotide sequence of 
PRSS11-L in the sample; and (3) finding genetic lesions 
by comparing the sequence with that of the wild- type 
gene . 

Examples of genetic mutations or genetic lesions of 
interest include, but are not limited to detection of : 1) a 
deletion of one or more nucleotides from the gene; 2) an 
addition of one or more nucleotides to the gene; 3) a 
substitution of one or more nucleotides of the gene; 4) a 
chromosomal rearrangement of the gene; 5) an alteration in 
the level of a messenger RNA transcript of the gene; 6) an 
aberrant modification of the gene, such as of the 
methylation pattern of the genomic DNA; 7) the presence of a 
non-wild type splicing pattern of a messenger RNA transcript 
of the gene; 8) a non-wild type level of a the protein 
encoded by the gene; 9) an allelic loss of the gene; and 10) 
an inappropriate post-translational modification of the 
protein encoded by the gene. Methods for detecting these 
genetic lesions or mutations are well known in the art. 
There are a large number of assay techniques known in the 
art which can be used for detecting lesions in a gene, such 
as PCR reactions, restriction enzyme cleavage patterns, 
hybridizing a sample and control nucleic acids, sequencing 
reactions, alterations in electrophoretic mobility, 
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selective oligonucleotide hybridization, selective 
amplification, and selective primer extension. 

Method of Treatment Using Gene Therapy 

The present invention provides methods of treating a 
subject at risk of or suffering from a disorder related to 
PRSS11-L by increasing or decreasing the expression of 
PRSS11-L using gene therapy. • 

In one embodiment, PRSS11-L antisense therapy can be 
used to decrease the expression of PRSS11-L in a cell. 
PRSS11-L antisense therapy can be particularly useful in 
decreasing PRSS11-L activity. 

The principle of antisense based strategies is based on 
the hypothesis that sequence-specific suppression of gene 
expression can be achieved by intracellular hybridization 
between mRNA and a complementary antisense species. The 
formation of a hybrid RNA duplex can then interfere with the 
processing/transport/translation and/or stability of the 
target PRSS11-L mRNA. Hybridization is required for the 
antisense effect to occur. Antisense strategies can use a 
variety of approaches including the use of antisense 
oligonucleotides, injection of antisense RNA and 
transfection of antisense RNA expression vectors. 
Phenotypic effects induced by antisense hybridization to a 
sense strand are based on changes in criteria such as 
protein levels, protein activity measurement, and target 
mRNA levels. 

An antisense nucleic acid can be complementary to an 
entire coding strand of a PRSS11-L gene, or to only a 
portion thereof. An antisense nucleic acid molecule can 
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also be complementary to all or part of a non-coding region 
of the coding strand of a PRSS11-L gene. The non-coding 
regions ("5' and 3 f untranslated regions") are the 5 ! and 3' 
sequences which flank the coding region and are not 
translated into amino acids. Preferably, the non-coding 
region is a regulatory region for the transcription or 
translation of the PRSS11-L gene. 

An antisense oligonucleotide can be, for example, about 
15, 25, 35, 45 or 65 nucleotides or more in length taken 
from the complementary sequence of SEQ ID N0:1. An 
antisense nucleic acid can be constructed using chemical 
synthesis and enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid 
(e.g., an antisense oligonucleotide) can be chemically 
synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the 
biological stability of the molecules or to increase the 
physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be 
used. Examples of modified nucleotides which can be used to 
generate the antisense nucleic acid include 5-f luorouracil, 
5- bromouracil, 5-chlorouracil, 5-iodouracil , hypoxanthine, 
xanthine , 4 - acetylcytosine , 5 - ( carboxyhydroxylmethyl ) 
uraci 1 , 5 - carboxy tnethylaminomet hyl - 2 - thiour idine , 5 - 
carboxymethylaminomethyluracil , dihydrouracil , beta-D- 
galactosylqueosine, inosine, N6-isopentenyladenine, I- 
methylguanine, 1-methylinosine, 2 , 2-dimethylguanine, 2- 
methyladenine, 2- methylguanine , 3-methylecytosine, . 5- 
methylcytosine, N6- adenine, 7- methylguanine, 5- 
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methylaminomethyluracil , 5-methoxyaminomethyl-2-thiouracil, 
beta-D- mannosylqueosine, 5 1 -methoxycarboxymethyluracil , 5- 
methoxyuracil, 2- methyl thio-N6- isopentenyladenine , uracil - 
5-oxyacetic acid (v) , wybutoxosine, pseudouracil , queosine, 
5 2- thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4- 
thiouracil, 5- methyluracil , uracil-5- oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3- amino-3-N-2-carboxypropyl) uracil, 
(acp3 )w, and 2 , 6-diaminopurine . An antisense nucleic acid 

10 molecule can be a CC-anomeric nucleic acid molecule. A CC- 
anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, contrary 
to the usual P-units, the strands run parallel to each other 
(Gaultier et al . (1987) Nucleic Acids Res. 15:6625-664 1). 

15 The antisense nucleic acid molecule can also comprise a 2'- 
o-methylribonucleotide (Inoue et al . (1987) Nucleic Acids 
Res. 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue et 
al. (1987) FEBS Lett. 215:327-330). 

Alternatively, the antisense nucleic acid can also be 

20 produced biologically using an expression vector into which 
a nucleic acid has been subcloned in an antisense 
orientation as described supra. The antisense expression 
vector can be in the form of a recombinant plasmid, phagemid 
or attenuated virus in which antisense nucleic acids are 

25 produced under the control of a high efficiency regulatory 
region, the activity of which can be determined by the cell 
type into which the vector is introduced. To achieve 
sufficient intracellular concentrations of the antisense 
molecules, vector constructs in which the antisense nucleic 

30 acid molecule is placed under the control of a strong pol II 
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or pol III promoter are preferred. For a discussion of the 
regulation of gene expression using antisense genes see 
Weintraub et al. (1985, Trends in Genetics, Vol. 1(1), pp. 
22-25) . 

5 Typically, antisense nucleic acid is administered to a 

subject by microinjection, liposome encapsulation or 
generated in situ by expression from vectors harboring the 
antisense sequence. An example of a route of administration 
of antisense nucleic acid molecules includes direct 

10 injection at a tissue site. The antisense nucleic acid can 
be ligated into viral vectors that mediate transfer of the 
antisense nucleic acid when the viral vectors are introduced 
into host cells. Suitable viral vectors include retrovirus, 
adenovirus, adeno-associated virus, herpes virus, vaccinia 

15 virus, polio virus and the like. Alternatively, antisense 
nucleic acid molecules can be modified to target selected 
cells and then administered systemically . For example, for 
systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens 

20 expressed on a selected cell surface, e.g., by linking the 
antisense nucleic acid molecules to peptides or antibodies 
which bind to cell surface receptors or antigens. 

Once inside the cell, antisense nucleic acid molecules 
hybridize with or bind to cellular mRNA and/or genomic DNA 

25 encoding a PRSS11-L protein to thereby inhibit expression, 
e.g., by inhibiting transcription and/or translation. The 
hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in 
the case of an antisense nucleic acid molecule which binds 

30 to DNA duplexes, through specific interactions in the major 
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groove of the double helix. 

In a preferred embodiment, when it is beneficial to 
decrease PRSS11-L activity, the method of decreasing the 
expression of PRSS11-L in a subject in need thereof involves 
5 the use of small interfering RNA (siRNA) . 

In several organisms, introduction of double -stranded 
RNA has proven to be a powerful tool to suppress gene 
expression through a process known as RNA interference. 
Many organisms possess mechanisms to silence any gene when 

10 double- stranded RNA (dsRNA) corresponding to the gene is 

present in the cell. The technique of using dsRNA to reduce 
the activity of a specific gene was first developed using 
the worm C. elegans and has been termed RNA interference, or 
RNAi (Fire, et al . , (1998), Nature 391: 806-811). RNAi has 

15 since been found to be useful in many organisms, and 

recently has been extended to mammalian cells in culture 
(see review by Moss, (2001), Curr Biol 11: R772-5) . 

An important advance was made when RNAi was shown to 
involve the generation of small RNAs of 21-25 nucleotides 

20 (Hammond et al . , (2000) Nature 404: 293-6; Zamore et al . , 
(2000) Cell 101: 25-33). These small interfering RNAs, or 
siRNAs, can initially be derived from a larger dsRNA that 
begins the process, and are complementary to the target RNA 
that is eventually degraded. The siRNAs are themselves 

25 double -stranded with short overhangs at each end; they act 
as guide RNAs, directing a single cleavage of the target in 
the region of complementarity (Elbashir et al . , (2001) Genes 
Dev 15: 188-200) . 
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Methods of producing siRNA of generally 21 to 23 
nucleotides (nt) in length from an in vitro system and use 
of the siRNA to interfere with mRNA of a gene in a cell or 
organism are described in WO0175164 A2 . 
5 The siRNA can also be made in vivo from a mammalian 

cell using a stable expression system. A new vector system, 
named pSUPER, that directs the synthesis of small 
interfering RNAs (siRNAs) in mammalian cells, was recently 
reported (Brummelkamp et al . , (2002) Science 296: 550-3.). 

10 To produce pSUPER, the Hl-RNA promoter was cloned in 

front of the gene specific targeting sequence (19-nt 
sequences from the target transcript separated by a short 
spacer from the reverse complement of the same sequence) and 
five thymidines (T5) as a termination signal. The resulting 

15 transcript is predicted to fold back on itself to form a 19- 
base pair stem- loop structure, resembling that of C. elegans 
Let-7. The size of the loop (the short spacer) is 
preferably 9 bp. A small RNA transcript lacking a poly- 
adenosine tail, with a well-defined start of transcription 

20 and a termination signal consisting of five thymidines in a 
row (T5) was produced. Most importantly, the cleavage of 
the transcript at the termination site is after the second 
uridine yielding a transcript resembling the ends of 
synthetic siRNAs, that also contain two 3' overhanging T or 

25 U nucleotides. The siRNA expressed from pSUPER is able to 
knock down gene expression as efficiently as the synthetic 
siRNA. 

The present invention provides a method of decreasing 
the expression of PRSS11-L in a cell of a subject in need 
30 thereof, comprising the steps of (a) introducing siRNA that 
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vaccinia virus, bovine papilloma virus, and herpes virus 
such as Epstein- Barr virus. Gene transfer could also be 
achieved using non-viral means requiring infection in vitro. 
This would include calcium phosphate, DEAE dextran, 
electroporation, and protoplast fusion. Targeted liposomes 
can also be potentially beneficial for delivery of DNA into 
a cell. 

For example, a DNA molecule encoding the PRSS11-L 
protein, is first cloned into a retroviral vector. The 
expression of PRSS11-L protein from the vector is driven 
from its endogenous promoter or from the retroviral long 
terminal repeat or from a promoter specific for certain 
target cells. The vector is then introduced into a cell of 
a subject to successfully express PRSS11-L proteins in the 
target cells. The gene is preferably delivered to those 
cells in a form which can be used by the cell to encode 
sufficient protein to provide effective function. 
Retroviral vectors are often a preferred gene delivery 
vector for gene therapy especially because of their high 
efficiency of infection and stable integration and 
expression. Alternatively, PRSS11-L DNA can be transferred 
into cells for gene therapy by non-viral techniques 
including receptor-mediated targeted DNA transfer using 
ligand-DNA conjugates or adenovirus -ligand-DNA conjugates, 
lipofection membrane fusion or direct microinjection. These 
procedures and variations thereof are suitable for ex vivo 
as well as in vivo PRSS11-L gene therapy. Protocols for 
molecular methodology of gene therapy suitable for use with 
the PRSS11-L gene are described in Gene Therapy Protocols, 
edited by Paul D. Robbins, Human press, Totowa NJ, 1996. 
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Methods of Identifying Modulators of PRS11-L 

"Inhibitors", "activators", and "modulators" of PRSS11- 
L refer to inhibitory or activating molecules identified 
using in vitro and in vivo binding assays for PRSS11-L. 
Preferably by measuring the serine protease activity of 
PRSS11-L, the binding affinity of PRSS11-L to other 
proteins, or the chaperon activity of PRSS11-L. 

In particular, "inhibitors" refer to compounds that 
decrease, prevent, inactivate, desensitize or down regulate 
PRSS11-L expression or activity. "Activators" are compounds 
that increase, activate, facilitate, sensitize or up 
regulate PRSS11-L expression or activity. "Modulators" 
include both the "inhibitors" and "activators". 

The compound identification methods can be performed 
using conventional laboratory formats or in assays adapted 
for high throughput. The term "high throughput" refers to 
an assay design that allows easy screening of multiple 
samples simultaneously, and can include the capacity for 
robotic manipulation. Another desired feature of high 
throughput assays is an assay design that is optimized to 
reduce reagent usage, or minimize the number of 
manipulations in order to achieve the analysis desired. 
Examples of assay formats include 96 -well or 384 -well 
plates, levitating droplets, and "lab on a chip" 
microchannel chips used for liquid handling experiments. It 
is well known by those in the art that as miniaturization of 
plastic molds and liquid handling devices are advanced, or 
as improved assay devices are designed, that greater numbers 
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of samples can be performed using the design of the present 
invention. 

Candidate compounds encompass numerous chemical 
classes, although typically they are organic compounds. 
5 Preferably, they are small organic compounds, i.e., those 
having a molecular weight of more than 5 0 yet less than 
about 2500. Candidate compounds comprise functional 
chemical groups necessary for structural interactions with 
polypeptides, and typically include at least an amine, 

10 carbonyl, hydroxyl or carboxyl group, preferably at least 

two of the functional chemical groups and more preferably at 
least three of the functional chemical groups. The 
candidate compounds can comprise cyclic carbon or 
heterocyclic structure and/or aromatic or polyaromatic 

15 structures substituted with one or more of the above- 
identified functional groups. Candidate compounds also can 
be biomolecules such as peptides, saccharides, fatty acids, 
sterols, isoprenoids, purines, pyrimidines, derivatives* or 
structural analogs of the above, or combinations thereof and 

20 the like. Where the compound is a nucleic acid, the compound 
typically is a DNA or RNA molecule, although modified 
nucleic acids having non-natural bonds or subunits are also 
contemplated. 

Candidate compounds are obtained from a wide variety of 
25 sources including libraries of synthetic or natural 

compounds. For example, numerous means are available for 
random and directed synthesis of a wide variety of organic 
compounds and biomolecules, including expression of 
randomized oligonucleotides, synthetic organic combinatorial 
30 libraries, phage display libraries of random peptides, and 
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preparation of combinatorial libraries can readily prepare 
such libraries based on the existing serine protease 
activators/inhibitors (see Abato et al., (1999), J Med 
Chem., 42:4001-9; Zega et al.,(2001), Bioorg Med Chem, 
5 9:2745-56) . 

A variety of other reagents also can be included in 
the mixture. These include reagents such as salts, 
buffers, neutral proteins (e.g., albumin), detergents, 
etc. that can be used to facilitate optimal protein- 

10 protein and/or protein-nucleic acid binding. Such a 
reagent can also reduce non-specific or background 
interactions of the reaction components. Other reagents 
that improve the efficiency of the assay such as nuclease 
inhibitors, antimicrobial agents, and the like can also 

15 be used. 

Examples of methods for the synthesis of molecular 
libraries can be found in the art, for example in: 
Zuckermann et al. (1994). J Med. Chem. 37:2678. Libraries 
of compounds can be presented in solution (e.g., Houghten 

20 (1992) Biotechniques 13:412-421), or on beads (Lam (1991) 

Nature 354 :82-84) , chips (Fodor (1993) Nature 364 : 555-556) , 
bacteria (U.S. Patent No. 5,223,409), spores (Patent NO. 
5,571,698), plasmids (Cull et al . (1992) Proc. Natl. Acad. 
Sci. USA 89:1865-18.69) or phage (see e.g., Scott and Smith 

25 (1990) Science 249:3 86-390). 

Methods of Identifying A Compound That Modulates PRSS11-L 
Expression 

As used herein, "a compound that modulates PRSS11-L 
expression'' includes compounds that increase or decrease 
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PRSS11-L gene transcription and/or translation. The 
invention provides a method of identifying such a compound, 
which comprises the steps of contacting a compound with a 
regulatory sequence of the PRSS11-L pacemaker gene or a 
5 cellular component that binds to the regulatory sequence; 

and determining the effect of the compound on the expression 
of a gene controlled by the regulatory sequence; wherein the 
regulatory sequence of the PRSS11-L gene is either within a 
host cell or in a cell-free system. 

10 In a preferred embodiment, the method involves 

detecting activity of a regulatory sequence of the 
PRSS11-L gene in a target cell. The cell-based assay 
comprises the step of: (1) contacting a compound with a 
cell comprising a regulatory sequence for a PRSS11-L gene 

15 or a cellular component that binds to the regulatory 

sequence operably linked to PRSS11-L or a reporter gene; 
(2) measuring the effect of the compound on the 
expression of the PRSS11-L or the reporter gene; and (3) 
comparing the effect of the compound with that of a 

20 reference control. The target cell can be a native 
PRSS11-L containing cell cell, or a cell where the 
PRSS11-L gene has been introduced. The reference control 
preferably contain the vehicle in which the testing 
compound is dissolved. Several assay methods can be used 

25 to measure the effect of the compound on the expression 
of the PRSS11-L or reporter gene inside a cell. For 
example, gene or protein fusions comprising the 
regulatory sequence for a PRSS11-L coding sequence linked 
to a reporter gene can be used. 

30 As used herein, "a reporter gene" refers to a gene 
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whose gene product can be measured using conventional lab 
techniques. Such reporter genes include, but are not 
limited to, genes encoding green fluorescent protein 
(GPP) , p-galactosidase, lucif erase, chloramphenicol 
5 acetyl transferase, P -glucuronidase, neomycin 

phosphotransferase, and guanine xanthine phosphor ibosyl- 
transf erase . 

The gene fusion is constructed such that only the 
transcription of the reporter gene is under control of 

10 the PRSS11-L regulatory sequence. The protein fusion is 
constructed so that both the transcription and 
translation of the reporter gene protein are under 
control of the PRSS11-L regulatory sequence. Preferably, 
a second gene or protein fusion comprising the same 

15 reporter but a different regulatory sequence (i.e., a 
regulatory sequence for a gene unrelated to PRSS11-L 
protease) can be used to increase the specificity of the 
assay. The effect of the compound on the expression of 
the reporter gene, such as GFP, can be measured by 

20 methods known to those skilled in the art. For example, 
the effect of the compound on expression of GFP can be 
measured as the effect of the compound on emissions of 
green fluorescence from the cell using a fluorometer. 
Alternatively, a cellular phenotype attributed to PRSS11- 

25 L protease, such as the protease activity of PRSS11-L, 

can also be used to measure the effect of the compound on 
the expression of the PRSS11-L protein. In addition, the 
effect of the compound can be assayed by measuring the 
amount of PRSS11-L mRNA or protein inside the cell 
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directly using methods described infra (i.e., Northern 
Blot, RT-PCR, SDS-PAGE, Western Blot, etc) . 

Note that the cell -based method described herein not 
only identifies compounds that regulate PRSS11-L, 
5 expression directly via binding to the regulatory 
sequence of the PRSS11-L gene, but also identifies 
compounds that regulate PRSS11-L expression indirectly 
via binding to other cellular components whose activities 
influence PRSS11-L expression. For example, compounds 
10 that modulate the activity of a transcriptional activator 
or inhibitor for PRSS11-L genes can be identified using 
the method described herein. 

In another embodiment, the method involves a 
regulatory sequence of the PRSS11-L gene in a cell-free 
15 assay system. The cell-free assay comprises the step of: 

(1) contacting a compound with a regulatory sequence for 
a PRSS11-L gene or with a cellular component that binds 
to the regulatory sequence in a cell -free assay system; 

(2) measuring the effect of the compound on the 

20 expression of the PRSS11-L or reporter gene controlled by 
the regulatory sequence; and (3) comparing the effect of 
the compound with that of a reference control. The 
reference control contains only the vehicle in which the 
testing compound is dissolved. Examples of a cell-free 

25 assay system include in vitro translation and/or 

transcription systems, which are known to those skilled 
in the art. For example, the full length PRSS11-L cDNA, 
including the regulatory sequence, can be cloned into a 
plasmid. Then, using this construct as the template, 

30 PRSS11-L protein can be produced in an in vitro 
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transcription and translation system. Alternatively , 
synthetic PRSS11-L mRNA or mRNA isolated from PRSS11-L 
protein producing cells can be efficiently translated in 
various cell -free systems, including but not limited to 
5 wheat germ extracts and reticulocyte extracts. The 

effect of the compound on the expression of the PRSS11-L* 
or reporter genes controlled by the regulatory sequence 
can be monitored by direct measurement of the quantity of 
HCN or reporter mRNA or protein using methods described 
10 infra. 

Methods of Identifying A Compound That Modulates PRSS11-L 
Biological Activity 

As used herein, "a compound that modulates PRSS11-L 
biological activity" includes compounds that increase or 

15 decrease one or more biological activities of PRSS11-L. 

Such compounds include, but are not limited to, compounds: 
1) that increase or decrease the serine protease activity of 
PRSS11-L; 2) that increase or decrease the protein stability 
of the PRSS11-L; 3) that increase or decrease the chaperon 

20 activity of PRSS11-L; or 4) that increase or decrease the 

binding affinity of PRSS11-L to other proteins. The present 
invention provides methods of identifying such a compound, 
comprising the steps of contacting a test compound with a 
PRSS11-L protein; and determining the effect of the compound 

25 on the biological activities of the PRSS11-L protein. 

In one preferred embodiment, compounds that increase 
or decrease the protease activity of PRSS11-L can be 
identified by a method comprising the steps of: (1) 
contacting a test compound with a PRSS11-L protein and 

30 with a substrate that is cleavable by the PRSS11-L 
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protease; and (2) determining whether the test compound 
increases or decreases the cleavage of said substrate by 
the PRSS11-L protease. While various appropriate 
substrates can be designed for use in the assay, such as 
case casein, preferably, a labeled peptidyl substrate 
comprising the PRSS11-L protease cleavage site, is used. 
Labeled substrates include, but are not limited to; 
substrate that is radiolabeled (Coolican et al., (1986), 
J". Biol. Chem. 261:4170-6), fluorometric (see Lonergan et 
al., (1995), J. Food Sci . 60:72-3, 78) or colorimetric 
(Buroker-Kilgore et al.,(1993), Anal. Biochem. 208:387- 
92) . Radioisotopes useful for use in the present 
invention include those well known in the art, 
specifically 125 I, 131 l, 3 H , 14 C, 35 S, 32 P, and 33 P. 
Radioisotopes are introduced into the peptide by 
conventional means, such as iodination of a tyrosine 
residue, phosphorylation of a serine or threonine 
residue, or incorporation of tritium, carbon or sulfur 
using radioactive amino acid precursors. Zymography 
following SDS polyacrylamide gel electrophoresis 
(Wadstroem et al . , . (1973), Sci. Tools 20:17-21), as well 
as by fluorescent resonance energy transfer (FRET) -based 
methods (Ng et al . , (1989), Anal. Biochem. 183:50-6) are 
also methods used to detect compounds that modulate 
protease PRSS11-L proteolytic activity. A variety of 
methods can be used to detect the label, depending on the 
nature of the label and other assay components. For 
example, the label can be detected while bound to the 
solid substrate or subsequent to separation from the 
solid substrate. Labels can be directly detected through 
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optical or electron density, radioactive emissions, 
nonradiative energy transfers, etc. or indirectly 
detected with antibody conjugates, streptavidin-biotin 
conjugates, etc. Methods for detecting the labels are 
well known in the art. 

Compounds that are agonists will increase the rate of 
substrate degradation and will result in less remaining 
substrate as a function of time. Compounds that are 
antagonists will decrease the rate of substrate degradation 
and will result in greater remaining substrate as a function 
of time. To examine the extent of inhibition or activation, 
samples or assays comprising a PRSS11-L protein and its 
substrate are treated with a potential activator or 
inhibitor compound and are compared to control samples 
without the test compound. Control samples (untreated with 
test compounds) are assigned a relative protease activity 
value of 100%. Inhibition of PRSS11-L protease activity is 
achieved when the protease activity value relative to the 
control is about 75%, preferably 50%, more preferably 25-0%. 
Activation of PRSS11-L protease activity is achieved when 
the HCN activity value relative to the control is 110%, more 
preferably 150%, most preferably at least 200-500% higher or 
1000% or higher. 

The measurement means of the method of the present 
invention can be further defined by comparing two cells, one 
containing a PRSS11-L protein and a second cell originating 
from the same clone but lacking the PRSS11-L protein. After 
both cells are contacted with the same test compound, 
differences in PRSS11-L protease activities between the two 
cells are compared. Similarly, the measurement means of the 
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method of the present invention can be further defined by 
comparing two cell lysates derived from the above two cells. 
These techniques are also useful in establishing the 
background noise of these assays. One of ordinary skill in 
the art will appreciate that these control mechanisms also 
allow easy selection of cellular changes that are responsive 
to modulation of PRSS11-L protease activity. 

The term "cell" refers to at least one cell, but 
includes a plurality of cells appropriate for the 
sensitivity of the detection method. Cells suitable for the 
present invention can be bacterial, yeast, or eukaryotic. 

The term "cell lysate" refers to a collected of 
cellular components produced by the destructive process of 
lysing a cell or cells. 

The present invention provides methods to screen for 
proteins that interact with PRSS11-L. PRSS11-L interacting 
proteins could represent potential substrates or more likely 
modulators of PRSS11-L function and would likely yield clues 
as to the overall function of the PRSS11-L gene product. 
Methods to assay for protein-protein interactions are known 
to those skilled in the art. For example, the yeast two- 
hybrid system provides methods for detecting the interaction 
between a first test protein and a second test protein, in 
vivo, using reconst itution of the activity of a 
transcriptional activator. One method is disclosed in U.S. 
Patent No. 5,283,173; reagents are available from Clontech 
and Stratagene (San Diego, California) . Another alternative 
method is immunoaf f inity purification. Recombinant PRSS11-L 
is incubated with labeled or unlabeled cell extracts and 
immunoprecipitated with anti-PRSSll-L antibodies. The 



ORT-1644CIP 



immunoprecipitate is recovered with protein A-Sepharose and 
analyzed by SDS-PAGE. Unlabelled proteins are labeled by 
biotinylation and detected on SDS gels with streptavidin . 
Binding partner proteins are analyzed by microsequencing . 
5 Further, standard biochemical purification steps known to 
those skilled in the art can be used prior to 
microsequencing. Yet another alternative method is 
screening of peptide libraries for binding partners. 
Recombinant tagged or labeled PRSS11-L is used to select 

10 peptides from a peptide or phosphopeptide library which 

interact with PRSS11-L. Sequencing of the peptides leads to 
identification of consensus peptide sequences which might be 
found in interacting proteins. 

In another preferred embodiment, binding assays can be 

15 used to identify a compound that binds to PRSS11-L protein, 
and potentially is capable of increasing or decreasing the 
biological activity of PRSS11-L protein. One exemplary 
method comprises the steps of: (a) incubating a test 
compound with a PRSS11-L protein and a labeled ligand for 

20 the PRSS11-L protein; (b) separating the PRSS11-L protein 

from unbound labeled ligand; and (c) identifying a compound 
that inhibits ligand binding to the subunit by a reduction 
in the amount of labeled ligand binding to the PRSS11-L. An 
example of the labeled ligand for PRSS11-L protein is a 

25 labeled PRSS11-L specific antibody as described supra. 
Preferably, a PRSS11-L host cell (recombinant or native) 
that expresses the PRSS11-L can be used for the binding 
assay. More preferably, cell lysates prepared from the 
PRSS11-L host cell can be used for the binding assay. 

30 Further preferably, a substantially purified PRSS11-L 
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protein can be used for the binding assay. 

Separation of the PRSS11-L protein from unbound labeled 
ligand can be accomplished in a variety of ways. 
Conveniently, at least one of the. components is immobilized 
on a solid substrate, from which the unbound components can 
be easily separated. The solid substrate can be made of a 
wide variety of materials and in a wide variety of shapes, 
e.g., microtiter plate, microbead, dipstick, resin particle, 
etc. The substrate preferably is chosen to maximize signal 
to noise ratios, primarily to minimize background binding, 
as well as for ease of separation and cost. 

Separation can be effected for example, by removing a 
bead or dipstick from a reservoir, emptying or diluting a 
reservoir such as a microtiter plate well, or rinsing a 
bead, particle, chromatographic column or filter with a wash 
solution or solvent. The separation step preferably includes 
multiple rinses or washes. For example, when the solid 
substrate is a microtiter plate, the wells can be washed 
several times with a washing solution that typically 
includes those components of the incubation mixture that do 
not participate in specific binding such as salts, buffer, 
detergent, non-specific protein, etc. Where the solid 
substrate is a magnetic bead, the beads can be washed one or 
more times with a washing solution and isolated using a 
magnet . 

A wide variety of labels can be used to label the 
PRSS11-L ligand, such as those that provide direct detection 
(e.g., radioactivity, luminescence, optical or electron 
density, etc), or indirect detection (e.g., epitope tag such 
as the FLAG epitope, enzyme tag such as horseradish 
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peroxidase, etc.) . 

In more than one embodiment of the above assay methods 
of the present invention, it can be desirable to immobilize 
either the polypeptide of the invention or its target 
molecule to facilitate separation of complexed from 
uncomplexed forms of one or both of the proteins, as well as 
to accommodate automation of the assay. Binding of a test 
compound to the polypeptide, or interaction of the 
polypeptide with a target molecule in the presence and 
absence of a candidate compound, can be accomplished in any 
vessel suitable for containing the reactants. Examples of 
such vessels include microtitre plates, test tubes, and 
micro-centrifuge tubes. In one embodiment, a fusion protein 
can be provided which adds a domain that allows one or both 
of the proteins to be bound to a matrix. For example, 
glutathione-S-transf erase fusion proteins or glutathione-S- 
transferase fusion proteins can be adsorbed onto glutathione 
sepharose beads (Sigma Chemical; St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then 
combined with the test compound or the test compound and 
either the non-adsorbed target protein or A polypeptide of 
the invention, and the mixture incubated under conditions 
conducive to complex formation (e.g., at physiological 
conditions for salt and pH) . Following incubation, the 
beads or microtitre plate wells are washed to remove any 
unbound components and complex formation is measured either 
directly or indirectly, for example, as described above. 
Alternatively, the complexes can be dissociated from the 
matrix, and the level of binding or activity of the 
polypeptide of the invention can be determined using 
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Example 1 

Isolation of the Protease PRSS11-L cDNA 

All molecular biological methods were in accordance 
with those previously described (Maniatis et al . , (1989), 
1-1626) . Oligonucleotides were purchased from Ransom 
Hill Biosciences (Ransom Hill, CA) and all restriction 
endonucleases and other DNA modifying enzymes were from 
New England Biolabs (Beverly, MA) unless otherwise 
specified. All construct manipulations were confirmed by 
dye terminator cycle sequencing using Allied Biosystems 
377 fluorescent sequencers (Perkin Elmer, Foster City, 
CA) . 

A recombinant phage containing a partial protease 
PRSS11-L cDNA was isolated from a human prostate library 

(Clontech, Palo Alto, CA) . The insert, was subjected to 
sequence analysis and found to contain sequences encoding a 
novel S2 protease. Although this particular isolate was ~2- 
Kb, additional sequence was obtained by 5' -RACE (Frohman, 

(1991), Methods Enzymol . 218:340-362) using human prostate 
Marathon Ready cDNA (Clontech, Palo Alto, CA) and Native Pfu 
Polymerase (Stratagene, La Jolla, CA) in accordance with the' 
manufacturer's recommendations. Two sequential 50 yl 20 
cycle PCR reactions of 94 °C for 30 sec.; 60.0°C for 30 sec; 
72°C for 3.0 min. The first set of reactions were performed 
using the Marathon adapter primer API and SEQ ID NO. 3: 
PRSSll-Like Pl-L 5 9 - CAGCCGTGACCTTGAGCGTGTTG- 3 ' and the 
second set of reactions used 5.0 pi of the product from the 
first reaction as a template, the Marathon adapter primer 
API and SEQ ID NO. 4: PRSSll-Like P2-L 5'- 
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GGCCGAGTGACCCAGCAACAAC-3' . An approx . 1 . 4 -Kb DNA molecule 
was amplified from the second set PCR reaction as the 5' - 
RACE DNA. The full length PRSS11-L cDNA was obtained by 
combining the 5' -RACE DNA molecule with the original 2 kb 
5 isolate via a unique SacI restriction site. The nucleotide 
sequence of the reconstructed PRSS11-L cDNA was sequenced, 
and depicted in SEQ ID NO: 1. 

The isolated PRSS11-L cDNA was found to contain an open 
reading frame of 1002 nucleotides excluding the TGA stop 

10 codon. The open reading frame is likely to be authentic 
since it is preceded by an in- frame TGA stop codon at 
position 657. This clone is also likely to contain the 
entire 3' untranslated region since a putative 
polyadenylation sequence (AATAAA) was identified just 25 

15 nucleotides upstream from a 17 nucleotide poly A stretch. 

The deduced open reading frame encodes a PRSS11-L protein of 
334 amino acids (SEQ ID NO:2), with an estimated molecular 
mass (M r ) of about 36 -Kd. 

The complete sequence of the isolated cDNA as well as 

20 the encoded protein was not known previously. A detailed 
comparison of the encoded protein sequence with entries of 
the Genbank database revealed that the encoded protein is 
homologous to the catalytic domains of human S2 serine 
proteases. The sequence shares 62.1 % identity within a 

25 defined 327 nucleotide overlap to human S2 serine protease 
L56/PRSS11-L, and 53.3 % identity in a 323 nucleotide 
overlap to another human S2 serine protease Omi/HtrA2 . 
These human S2 serine proteases belong to a family of 
proteases that has striking conservation between eukaryotes 

30 and prokaryotes. Therefore, the isolated cDNA herein is 
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designated as PRSS11-L (PRSSll-Like) and its encoded protein 
as PRSS11-L protein. 

Sequence alignment revealed that PRSS11-L protein 
shared strong homology to the catalytic domains of the other 
two human S2 serine proteases (Fig.l) . Motifs shared by all 
three human S2 serine proteases are TNAHW, DIA and 
GNSGGPLVNLDGEVIG within the catalytic domains with the 
catalytic triad residues H, D and S of protease PRSS11-L 
located at positions 72, 108 and 186, respectively (using 
the methionine initiator of the PRSS11-L sequence as number 
one) (Fig.l). In addition the catalytic domains of these 
three S2 proteases appear to be flanked by an SH3 domain at 
the amino terminus and a PDZ domain at the C- terminus, both 
domains are importantly involved protein- interaction (Mayer, 

(2000) , J. Cell Science 114: 1253-1263; Sheng et al . , 

(2001) , Annu. Rev. Neurosci. 24:1-29). Therefore, PRSS11-L 
can interact with other proteins through these interaction 
interfaces. It is formally possible that the PRSS11-L 
protein is initially synthesized as an inactive zymogen 
precursor, which requires one or more limited proteolytic 
cleavages to become active. Because PRSS11-L protein 
appears to lack any hydrophobic amino acid stretch 
consistent with either a signal sequence or transmembrane 
domain, it is not likely to be secreted or an integral 
membrane protein. 

Other nucleotide sequences within sequence databases of 
Genbank (AY040094) and the dgene_na patent (AAA57359 
WO200039149-A2; AAA57361 WO200039149-A2 ; AAZ52362 
WO200021986-A2; AAS26920 WO200155441-A2 ; AAS26848 
WO200155441-A2) were subsequently found to share some 
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sequence identities with the PRSS11-L cDNA (SEQ ID NO:l). 
Sequence analysis confirmed that these nucleotide sequences 
yield distinct mRNAs , which encode proteins having divergent 
N-termini . 

Example 2 

Confirmation of an authentic PRSS11-L mRNA 

The existence of an authentic mRNA corresponding to the 
PRSS11-L (SEQ ID NO:l) in human prostate tissue was 
confirmed by PCR using primers designed to flank the 
translational initiation site. 

PCR was performed using primers designed to flank the 
translational initiation site. Specifically, primers SEQ ID 
NO: 7, 5' -GCAAGTCGGGCTGGGGTGTG-3' , and SEQ ID NO: 8, 5' - 
CAGGGAGCTTTTTCTTGGGATGGA- 3 ' were designed to produce a 962- 
bp fragment between positions 413 and 1374 of the PRSS11-L 
cDNA (SEQ ID NO:l) . For this application human prostate 
Marathon Ready cDNA was used as the templates and the GC 
cDNA amplification kit (Clontech, Palo Alto, CA) were used 
in accordance with the manufacturer's recommendations. Two 
sequential 50 p.1 PCR reactions, one of 30 cycles and the 
second of 10 cycles using 94 °C for 30 sec; 62.4°C for 30 
sec; 68°C for 2.0 min. were performed. The PCR product was 
subcloned using the pGEM Easy TA cloning system (Promega) 
and subjected to sequence analysis. The analysis revealed a 
962-bp fragment with 100 per cent identity to the isolated 
PRSS11-L cDNA from human prostate between positions 413 and 
1374 of SEQ ID NO:l. This result confirmed that indeed, an 
authentic mRNA, corresponding to the PRSS11-L (SEQ ID NO:!), 
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does actually exist in human prostate tissue. 

EXAMPLE 3 

Tissue Distribution of The Protease PRSS11-L mRNA 

The tissue distribution of the PRSS11-L mRNA was 
examined using commercially available human multiple-tissue 
Northern blots (Clontech, Palo Alto, CA) . The -1.4-Kb 
PRSSll-L 5' -RACE product supra was used as a hybridization 
probe for the blots. The 32 P-radiolabelled probe detected a 
single mRNA of an approximately 3.0 kb, which is consistent 
with the size of the reconstructed PRSSll-L cDNA (SEQ ID 
NO:l) . The PRSSll-L mRNA transcript is widely expressed in 
several tissues throughout the body. It is highly expressed 
in heart, ovary; moderately in small intestine, colon, 
stomach, thyroid, trachea, adrenal gland; weakly in skeletal 
muscle, placenta, lung, pancreas, thymus, prostate, testis 
and bone marrow; very weakly in liver, kidney,, spleen, 
spinal cord and lymph node. The PRSSll-L protease mRNA does 
not appear to be expressed in human brain although this does 
not exclude the possibility of a low level of PRSSll-L 
expression in smaller subregions of the brain. 

In comparison, Northern analysis of L56 expression in 
human tissues indicated that an approximate 2.3 kb 
transcript of L56 is strongly expressed in placenta, 
moderately in brain, liver and kidney, and weakly in lung, 
skeletal muscle, heart and pancreas (Zumbrunn et al . , 
(1996) , supra) . In further comparison, Northern analysis of 
Omi expression in human tissues indicated that two distinct 
mRNA species of Omi, a major one of approximately 2 . 1 kb and 
a minor one of approximately 4.5 kb, are expressed strongly 
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in placenta and pancreas and weakly in heart, brain, lung, 
liver, muscle and kidney (Faccio et al . , (2000), supra) . 

EXAMPLE 3 

5 Vectors For The Expression of Protease PRSS11-L 

The protease PRSS11-L expression vectors were 
constructed using the baculovirus expression vector 
pFastBacl (Life Technologies, Gaithersberg, MD) as 
described below. 

10 The purified plasmid DNA containing the full-length 

protease PRSS11-L cDNA was used as a template in a 100 fil 
preparative PCR reaction using the Native Pfu Polymerase 
(Stratagene, La Jolla, CA) in accordance with the 
manufacturer's recommendations. Two primers, SEQ ID NO 

15 5: PRSS11-L Xba-U 5 ' - CGTGTCTAGAGCCATGCACCTGGCCCTTCCCGCC - 
3' and SEQ ID NO 6: PRSS11-L Xba-L 5' - 

GCGCTCTAGACATGACCACCTCAGGTGCGA- 3 ' , which contained Xba I 
cleavable sites were used for the PCR reaction. The 
preparative PCR reaction was run for 18 cycles of 94 °C 

20 for 3 0 sec; 62.2°C for 3 0 sec; and 72°C for 2.0 min. The 
PRSS11-L coding sequence (CDS) Xba DNA cassette, 
comprising the full-length coding sequence of PRSS11-L 
(from nucleotide 1011 to 2014 of SEQ ID NO:l) flanked by 
two Xbal cleavage sites, was amplified from the PCR 

25 reaction. It was phenol/CHCl 3 (1:1) extracted once, CHC1 3 
extracted, and then EtOH precipitated with glycogen 
carrier (Boehringer Mannheim Corp., Indianapolis, IN). 
The precipitated pellet was rinsed with 70 % EtOH, dried 
by vacuum, and resuspended in 80 pi H 2 0, 10 pi 10 
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restriction buffer number 2 and 1 \xl lOOx BSA (New 
England Biolabs, Beverly, MA) . 

The isolated PRSS11-L Xba DNA cassette was digested 
for 3 hr. at 37°C with 200 units Xba I restriction enzyme 
(New England Biolabs, Beverly, MA) . The Xba I digested 
product was phenol/CHCl 3 (1:1) extracted once, CHC1 3 
extracted, EtOH precipitated, rinsed with 70 % EtOH, and 
dried by vacuum. For purification from contaminating 
template plasmid DNA, the product was electrophoresed 
through 1.0 % low melting temperature agarose (Life 
Technologies, Gaithersberg, MD) gels in TAE buffer (40 mM 
Tris-Acetate, 1 mM EDTA pH 8.3) and excised from the gel. 

The construct, PRSSll-Like CDS-HAHISpFastBac, 
containing a HA6HIS epitope/af f inity tag fused in frame 
to the C-termus of the PRSS11-L CDS, was obtained by in- 
gel ligations of an aliquot of the excised PRSS11-L CDS 
Xba I DNA cassette together with an Xba I digested, 
dephosphorylated and gel purified, modified pFastBacl 
baculovirus transplacement vector. The DNA sequence of 
the construct was confirmed by sequence analyses. 

The PRSSll-Like CDS Xba I fragment was also 
subcloned into a modified pCINeo mammalian expression 
vector (Promega, Madison, WI) containing the green 
fluorescent protein with the C-terminal HAHIS epitope 
affinity tag. The resulting PRSSll-Like CDS-GFPHAHIS 
(pCINeo) construct expresses the recombinant PRSS11-L 
protein with GFP and HAHIS fused to its C-terminus in a 
transfected mammalian cell line. 

A second mammalian expression construct F-L PRSS11-L 
CDS-GFPHAHIS (pCINeo) was created to examine the role of 
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the long PRSS11-L 5' -untranslated region (5'-UTR) on the 
expression of PRSS11-L in in vivo cell culture 
transfection experiments. This construct contained the 
native PRSS11-L 5' -UTR and the PRSS11-L CDS fused in 
frame at its C-termus to the GFP HAHIS module. It was 
constructed following procedures similar to those of 
PRSSll-Like CDS-GFPHAHIS (pCINeo) construct. 

EXAMPLE 4 

Purification of Recombinant Protease PRSS11-L 

The expression vector PRSSll-Like CDS-HAHISpFastBac 
was first transformed into a bacterial host cell, and 
amplified inside such a bacterial transformant cell. The 
vector was purified and verified by PCR confirmation in 
accordance with the manufacturer's recommendations. 
Subsequently, the expression vector was used to transfect 
Sf9 insect cells (ATCC CRL-1711) . Several days later, 
conditioned media containing the recombinant protease 
PRSS11-L-CDS-HAHIS baculovirus was collected for viral 
stock amplification. PRSS11-L expression was confirmed 
by Western blotting analysis of recombinant baculovirus 
infected Sf9 cells using an anti-HA MoAb (Boehringer 
Mannheim Corp., Indianapolis, IN). 

Sf9 cells growing in Sf-900 II SFM at a density of 
2X10 6 /ml were infected at a multiplicity of infection of 2 
at 27 °C. Approximately 72 h later, cells were harvested and 
homogenized in 50 mM NaH 2 P0 4 (pH 8.0) and 300 mM NaCl by 30- 
40 strokes. The homogenate was mixed with Tween-20 at the 
final concentration of 0.3 % and centrifuged for 20 min at 
15,000 Xg for 30 min. The resulting supernatant was batch- 
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bound to Ni-NTA agarose beads (Qiagen, Valencia, CA) for 3 0 
min at 4°C. The mixture was packed into a column and washed 
with NaH 2 P0 4 (pH 8.0), 300 mM NaCl, and 25 mM imidazole. 
Then, the NTA agarose bead-bound PRSS11-L CDS-HAHIS was 
5 transferred to a micro-centrifuge tube in 50 mM Tris-HCl (pH 
7.5) and 50 mM NaCl . 

Samples of cell lysates or the purified protease 
PRSS11-L CDS-HAHIS, were denatured in the presence of sodiun 
duodecyl sulfate and the reducing agent P-mercaptoethanol by 

10 boiling, and analyzed by SDS-PAGE (Bio Rad, Hercules CA) . 
Polyacrylamide gels were either stained with Coomassie 
Brilliant Blue or subjected to Western blot analysis. For 
Western blotting, gels were electrotransf erred to Hybond ECL 
membranes (Amersham, Arlington Heights, IL) . The HA- tagged 

15 PRSS11-L-CDS-HAHIS recombinant protein was detected with the 
ant i -HA 12CA5 MoAb (Boehringer Mannheim) . The secondary 
antibody was a goat-anti-mouse IgG (H+L) , horseradish 
peroxidase-linked F(ab')2 fragment, (Boehringer Mannheim 
Corp., Indianapolis, IN) and was detected by the ECL kit 

20 (Amersham, Arlington Heights, IL) . 

EXAMPLE 5 

Expression of Recombinant Protease PRSS11-L in Mammalian 

cells 

25 The two mammalian expression constructs encoding 

PRSSll-Like CDS-GFPHAHIS and F-L PRSS11-L CDS-GFPHAHIS 
were transiently transfected into HEK-293 cells. Forty 
hours later,, green fluorescence resulting from the 
PRSSll-L-GFP-HAHis fusion protein was examined by 
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microscopy, and cell lysates generated were subjected to 
Western blot analysis using the anti-HA 12CA5 MoAb 
(Boehringer Mannheim) . This experiment demonstrated that 
the PRSS11-L protein was expressed in eukaryotic cells. 

EXAMPLE 6 

Cleavage of p-Casein by Purified Recombinant Proteases 

PRSS11-L 

The serine protease activity of PRSS11-L was 
demonstrated herein. 

The full length PRSS11-L cDNA was fused to a C-terminal 
HAHIS epitope/af f inity tag in frame on a baculovirus 
expression vector. The recombinant PRSS11-L protein with 
HAHIS epitope/af f inity tag at its carboxyl terminus was 
expressed in baculovirus -inf ected cells. The recombinant 
protein was detected by immunoblot analysis using anti-HA 
antibody. Consistent with that observed for E. coli HtrA 
and human HtrA (L56) , a distinct set of lower molecular 
weight proteins is also evident by immunoblot analysis, 
implying autocatalytic activity of PRSS11-L. The 
proteolytic activity of PRSS11-L was further confirmed by 
its cleavage of casein. 

To demonstrate the protease activity of PRSS11-L 
protein, 1 pg of purified PRSS11-L CDS-HAHIS bound to Ni-NTA 
agarose beads in 10 pi was incubated with 50 jag of p-caesin 
(Sigma, St Louis, MO) in 50 mM Tris-HCl (pH 7.5) and 50 mM 
NaCl. The total reaction was 100 pi. After lh incubation 
at 37°C, the mixture was spun briefly to pellet the PRSS11-L 
CDS-HAHIS bound to Ni-NTA agarose beads, and 25 pi of the 
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supernatant was taken for SDS-PAGE analysis. The cleavage 
of p-caesin was visualized by staining with Coomassie 
Brilliant Blue. Bovine trypsin was used as a positive 
control of p-casein cleavage and negative controls represent 
5 parallel reactions of casein incubated with Ni-NTA agarose 
beads, used to purify an irrelevant protein from baculovirus 
infected Sf9 cells, also in 50 mM Tris-HCl (pH 7.5) and 50 
mM NaCl. 

10 EXAMPLE 7 

Screening for compounds that modulates PRSS11-L proteolytic 

activity 

Compounds that modulate the -serine protease of the 
present invention are identified through screening for the 

15 acceleration, or more commonly, the inhibition of the 

proteolytic activity. Although in the present case casein 
cleavage activity is monitored by a PAGE assay, chromogenic 
or fluorogenic assays or other methods such as FRET to 
measure proteolytic activity as mentioned above, can be 

20 employed. Compounds are dissolved in an appropriate 

solvent, such as DMF, DMSO, methanol, and diluted in water 
to a range of concentrations usually not exceeding 100 jjM 
and are typically tested, though not limited to, a 
concentration of 1000-fold the concentration of protease. 

25 The compounds are then mixed with the protein stock 
solution, prior to addition to the reaction mixture. 
Alternatively, the protein and compound solutions can be 
added independently to the reaction mixture, with the 
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compound being added either prior to, or immediately after, 
the addition of the protease PRSS11-L protein. 

EXAMPLE 8 

5 PRSS11-L is a novel alternative splice variant of HtrA3 

Following the provisional filing, a sequence was 
submitted within GenBank (GenBank Accession No. BC034390) 
having about 87% sequence identity with the PRSSll-Like cDNA 

10 [SEQ ID No: 1] . This sequence, which is predicted to encode 
human HtrA3, was identified from the NIH-MGC Project. Part 
of the sequence (nt 589 - 2242) shares 100% identity with 
part of the SEQ ID NO: 1 (nt 1038 - 2691). The 5' end of 
the sequence (nt 1- 588) shares less than 45% sequence 

15 identity with that of the SEQ ID No: 1 (nt 1 - 1037) . 

Although the deduced amino acid sequence (GenBank Protein^ 
Id No: AAH34390) shares 100% identity with the PRSSll-Like 
amino acid sequence [SEQ ID No: 2] at the C-terminal 
catalytic domain, the sequences have divergent N-termini and 

20 thus represent distinct polypeptides (Figure 2) . 

More recently the isolation of the human HtrA3 cDNA has 
been published (Nie, G,-Y., et al . , (2003). Biochem. J. 
371:39-48) and these investigators isolated both a long and 
short isoform GenBank Accession Nos . AY280665 and AY280666, 

25 respectively), which have alternatively spliced C-termini. 
All of these GenBank submissions contain the same extended 
N- terminus, which contains a signal sequence suggesting that 
these forms are secreted. Significantly, the PRSSll-Like 
cDNA encodes a distinct form of polypeptide lacking this 

30 extended N- terminus, but containing the same catalytic 
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domain, suggesting that it encodes a non-secreted 
intracellular isoform of HtrA3 . 

The PRSSll-Like genomic structure was determined to 
establish whether the PRSSll-Like cDNA [SEQ ID No:l] is a 
5 distinct splice variant of the HtrA3 gene. This was 

accomplished by aligning both the HtrA3 long form cDNA and 
the PRSSll-Like cDNA sequences against the genomic sequence, 
a BAC clone from human chromosome 4 (GenBank Accession No. 
AC113 611) . The sequence alignments indicated that the HtrA3 

10 long form cDNA was composed of nine exons while the PRSSll- 
Like cDNA was composed of eight exons (Figure 3) . The HtrA3 
long form cDNA and the PRSSll-Like cDNA shared seven common 
exons at their 3' -ends. At their 5' -ends, the HtrA3 long 
form cDNA had two additional exons and the PRSSll-Like cDNA 

15 had one. Exon 1 of HtrA3 was distinct from the first exon 
of PRSSll-Like, possibly due to differential use of two 
distinct promoters initiating transcription at two different 
positions along the HtrA3 gene. . Exon 2 of HtrA3 is 
identical to part of the 3' -end coding sequence of the frist 

20 exon of the PRSS11-L cDNA. Figure 4 depicts the splice 
donor/acceptor sites for human HtrA3 and PRSS11-L 
transcripts along the HtrA3 gene. Based on these sequence 
alignment results, PRSSll-Like is indeed a novel alternative 
splice variant of the HtrA3 gene. 
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